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FOREWORD 



The Proceedings of the CUPM Geometry Conference 
will be issued in three parts; 



I Convexity and Applications (Lectures by 
Griinbaum and Klee) . 

II Geometry in Other Subjects (Lectures by 
Gleason and Steenrod) . 

Ill Geometric Transformation Groups , arid Other 
Topics (Lectures by Coxeter , and others) . 



The texts printed here are based on recordings 
made of the lectures and the discussions, and were 
prepared for publication by the assistants (Hausner, 
Reay, and Yale). The lecturers themselves were able 
to make minor changes and corrections on the final 
sheets, but an early deadline prevented major revision 
or extensive polishing of the texts. The typing for 
offset was done by Mrs. K. Black and the figures were 
prepared by Mr. David M. Youngdahl, 

Lincoln K. Durst 
Executive Director, CUPM 
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INTRODUCTION 



A Geometry Conference, sponsored by the Committee on the Undergraduate 
Program in Mathematics of the Mathematical Association of America, was held on 
the Santa Barbara campus of the University of California from June 12 to 
June 30 , 1967 , 

The conference had its genesis in a meeting of geometers which was called 
by P. C. Hammer to consider the status of geometry in our schools and colleges. 
The meeting was held in Chicago on January 27, 1966 and was attended by more 
than twenty mathematicians including R, D, Anderson, Chairman of the Committee 
on the Undergraduate Progiram in Mathematics, and E, G, Begle, Director of the 
School Mathematics Study Group. It was generally agreed that the immediate 
focus of the problem of geometry lay at the undergraduate level (although its 
solution was related to questions involving the graduate curriculum and pre- 
college mathematics) and that the problem should be referred to CUPM. In May, 
CUPM authorized a small conference to consider the question further. A meeting 
was held in the fall of 1966 at UCLA, attended by S. S. Chern, L. K. Durst, 

P. C. Hammer, P. J. Kelly, V. L. Klee, Jr., W. Prenowitz, N. E. Steenrod, with 
Anderson as chairman. After preliminary discussion of the problem, Chern pro- 
posed a summer conference of undergraduate mathematics teachers who would attend 
lectures on various aspects of geometry, analyze the material presented, and 
examine its relevance to the undergraduate curriculum. Steenrod suggested that 
in addition lectures be given on the geometric underpinning of other branches 
of mathematics. This new and experimental format, which would involve features 
of a summer institute and a *seminar on curriculum, quickly received approval of 
the group and of CUPM itself. Anderson appointed a Planning Committee consist- 
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of Hammer, Kelly and Klee, with Frenowltz as chairman. 

The Planning Committee proposed a conference of four weeks duration but 
consented to a reduction to three weeks for fiscal and other reasons, despite 
misgivings that there might not be enough time for a natural development of 
currlculiun discussion. Twenty-one college teachers were chosen as participants 
by Invitation, since the experimental nature of the conference did not require 
a large membership. 

The following mathematicians lectured for a period of one or more weeks: 

H. S. M. Coxeteri A. M. Gleason, B. Grunbaua, V. L. Klee, Jr., N. E. Steenrod. 
Shorter series of lectures were given by H. Busemann, G. Culler, F. C. Hammer, 
P. J. Kelly and W. Frenowltz. Each lecture was followed by a discussion period 
this helped to contribute a freshness of spirit to the discussion since ques- 
tions, remarks and challenges did not have a chance to be forgotten or lose 
their cutting edge. Several discussions on curriculum were scheduled as the 
need arose . The program was supplemented by the showing of several films pro- 
duced by the College Geometry Project of the University of Minnesota. 

An important innovation was the selection of three assistants to the 
lecturers, younger mathematicians who were responsible for writing up the 
lecture notes and leading discussions on the material and its relation to the 
undergraduate curriculum. M. Hausner, J. R. Reay, and P. B. Yale were chosen 
for these assignments and carried them out with singular dedication. 

The text which follows gives a record, sometimes in summary form, of the 
lectures and discussions of the conference. 

Walter Frenowltz 
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PROCEEDINGS 




APPLICATIONS OF GEOMETRY 
Lectures by Victor Klee 

(Lecture notes by Melvin Hausner and John Reay) 



Lecture 1 , 

During the past few years, we have witnessed the effects of the many 
efforts to revise and improve the mathematical curriculum (SMSG, CUPM, etc») • 
These efforts have proved to be widely influential and beneficial much more 
so than we might have imagined when they were initiated. However, the possi- 
bility of further progress should not be ignored. In particular, an informal 
session at the 1966 Chicago A.M.S. meeting made it clear that many geometers 
favor a broad further revision of the geometry curriculum from the kindergarten 
through the senior undergraduate level. Since such a broad attack was clearly 
not feasible, it was decided to start at the college level. Hence we are 
gathered here. 

Let us consider the following four questions, 

1, Is collegiate geometry instruction in a state of decline? 

2, If so, is this undesirable? 

3, If this is undesirable, what should be done to improve the situation? 

4, What should a college geometry course consist of? 

Our assumption is that questions 1 and 2 have affirmative answers and 
that questions 3 and 4 are only slightly different. Once question 4 has been 
answered, the next step will be to write the appropriate books. 

(It is profitable to compare the situation of geometry with that of 
algebra. In algebra, thousands of students have learned from the books by 
van der Waerden or from Birkhoff and MacLane or their imitators. Hence the 
algebraists have a common experience which serves to strengthen their suoject. 
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I do not believe the universality of these books was a result of g;eneral agree- 
ment on what belonged in an algebra course at the time they were written. 

Rather, it occurred because the books were written so well that their readers 
came to think that this was the only natural way to proceed.) 

Vhat are the desirable characteristics of an undergraduate collegiate geom- 
etry course? In my opinion, they should include the following points. Except 
for the first and third, these apply as well to any college mathematics course. 

a) The course should be n- dimensional, and even infinite- dimensional if 
this is possible with little extra cost. Anything less will be a letdown for 
the student and will minimize the usefulness of the course. 

b) There should be a unifying theme throughout the course. For example, 
it might be the study of certain objects, such as differentiable manifolds or 
convex bodies. Or it might be some notion, such as invariance or symmetry. 

c) It is more important to study the geometric objects, their structure 
and their properties, than to have an esthetically pleasing axiom scheme. One 
should use the most powerful approach rather than the most esthetic one. Thus 
the axiomatic basis should be specifically "geometrical" in nature only if no 

loss of efficiency results from this. 

d) It should emphasize the points of contact with other areas of mathe- 
matics . 

e) It should include applications to science and technology wherever 
possible , 

f) Unsolved problems should be mentioned in order to whet the students* 
interest • 

Subjects which might satisfy these criteria for a geometry course are 
differentiable manifolds, algebraic topology, n-dimensional projective geometry. 
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and the geometry of convex bodies. There are of course several other possibil- 
ities , 

My candidate is a course on convex bodies. As far as point a) is concern- 
ed, the subject is easily carried out in n dimensions. As Grunbaum pointed 
out, many of the methods used for n-space are essentially two- or three- 
dimensional, making motivation and intuition very easy in n dimensions even 
for the inexperienced. As to point b) , the unifying theme is the study of the 
properties of convex sets. This is very pictorial, and close in spirit to the 
familiar Euclidean geometry, but it makes much more contact with modern mathe- 
matics. The method which seems to fit the criterion of point c) is to use 
real vector spaces as the setting for the theory. However, if Prenowitz*s 
"join geometry" approach can be developed in more detail, it might very well 
be used instead. The possibilities for point f) are excellent (cf . Grunbaum* s 
lectures) , Finally there are many connections of convexity with other areas 
of mathematics, and many applications in science and technology. Let us 
mention just a few, 

1. In functional analysis, three of the most important tools are separa- 
tion theorems for convex sets (the basic separation theorem being equivalent 
to the Kahn- Banach theorem) y extreme-point theorems for convex sets, and fixed- 
point theorems for convex sets. Inclusion of these topics in a convexity 
course serves simultaneously to teach the student some interesting geometrical 
facts and to prepare him for a later course in functional analysis , 

2, In a very natural way, a number of topological notions can be brought 
into a course on convexity. For example, the Euler characteristic can be 
approached in a combinatorial way based on convex sets. Having defined 
simplicial subdivisions and proved that a simplex admits subdivisions of 
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arbitrarily small mesh, then by using Ky Fan*s combinatorial lemmas (taking 
perhaps one hour), it is possible to prove (perhaps in two hours) such basic 
results as Brouwer *s fixed point theorem, the Borsuk-Ulam antipodal mapping 
theorem, the invariance of domain theorem, the Lebesgue tiling theorem, etc. 

One can then proceed to use these results in obtaining further properties of 

convex sets , 

3. In a study of the combinatorial properties of convex sets, various 
combinatorial identities enter in a natural way and of course there are many 
contacts with graph theory. At present, matroids do not seem to arise 
naturally in such a course, but techniques now being developed will probably 
change this situation in a few years . (Jon Folkman has a new combinatorial 
notion which is related to positive bases as matroids are related to linear 

bases .) 

4, In inequality theory, many results are best viewed from the point of 
view of convex functions. There are applications of convexity in summability 

theory and in many other parts of mathematics . 

There are many areas of modern applied mathematics which use convexity 

in a significant way. Some of them are listed below. 

5. In linear programming theory, the study of convex polytopes forms the 

essential geometric background, 

6, In ntuch of nonlinear programming theory, convex or concave functions 
form the background, 

7, Control theory uses the separation theorems, as well as Liapounoff's 
theorem asserting ther convexity of the range of a vector measure, 

8. The theory of pattern classification uses separating hyperplanes in 

a fundamental way, and in particular SchlSfli's theorem on the number of regions 
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determined by a finite set of hyperplanes in general position through the 
origin. 

9. Parts of information theory are closely related to packing problems for 
convex bodies. 

I readily confess that I have not attained a sufficiently high degree of 
organization to cover all the above topics in a one-year course, but I believe 
it to be possible. The unifying theme makes this less of a hodge-podge than 
it appears to be. 



Discussion . 

The question of a unifying theme was brought up. In particular, Klee 
admitted that this was not so apparent in Birkhoff and MacLane, although the 
notion of a group permeates this text. In this connection Klee felt that a 
course based on the notion of transformation as a unifying theme would be too 
bland in nature, with too few deep or exciting geometrical results. 

Point c) of the ideal geometry course was discussed in some detail. This 
led to the general question of whether the course should be content-oriented 
or foundation-oriented, with Klee and most of the vocal participants choosing 
the former. However, when it was suggested that a mathematics course without 
axioms is simply not a mathematics course, Klee stressed that he did not wish 
to eliminate the axioms--he wanted to use an efficient axioms system and not 
concentrate on esthetics. In any event, the underlying axioms would be stated. 

When the question of the importance of n-diraensional space was brought up, 
it was pointed out that a junior or senior level course was under discussion 
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and that this should not cause problems at that level. 

Finally, there was general agreement that a year's course on convex bodies 
should not be the sole upper level geometry course. It should be one of 
several. But Klee felt that one geometry course which has some depth is better 
than the customary hodge-podge course. 



Lecture II « 

We shall now discuss some applications of convexity which might be in- 
cluded in the sort of course Grunbaum is discussing. These are not applica- 
tions in the strictest sense of the term. Rather, they illustrate the sort of 
•'pure applied mathematics’* in which one takes certain objects which come from 
applied mathematics and then one studies them for their own sake, just as one 
would any other mathematical object. (Note that much of mathematics has 
developed in exactly this way.) 

We now introduce a few mathematical notions which form the necessary 
background for the material. These will be elaborated in Griinbaum's lectures. 

Definition, An extreme point p of a convex set C is a point p of C 
such that C [p] is convex. (We use the notation A B for set- theoretic 
difference.) Equivalently, p is an extreme point of the convex set C if p 
is in C but is not the mid-point of any segment joining two points of C. 

For example, the extreme points of a plane convex polygon are the vertices 

of that polygon. 

Extreme points and techniques for manipulating them constitute one of the 
three aspects of convex sets which are most important in connection with 
applications of convexity to other portions of mathematics. (The other two 
are separation theorems and fixed-point properties.) The importance of 
extreme points is due in part to the following fact. 

Theorem . If K is compact, convex, and finite dimensional, then K is 

the convex hull of its extreme points; that is, K « con ext K, 

(This result goes back to Minkowski. In the corresponding infinite- 
dimensional result of Krein and Milman it is necessary to take the closed 
convex hull of the extreme points.) 
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If we abandon the restriction of compactness, we have the following 



analogous result. 

Theorem, If K is closed, convex, finite-dimensional, and line -free 
(no line is contained in K) , then K is the convex hull of its extreme points 
and its extreme rays; that is K = con (ext K U K) , (An extreme ray of a 
convex set K is a half line contained in K which is not crossed by any 
segment in K.) 



Another important reason for studying extreme points is in their applica- 
tion to a wide variety of practical optimization problems. For example, 
suppose one has a cost function which is to be minimized. In many cases, this 
function f is defined ‘over a convex set C and la many applications f is 
affine, or at least concave, A basic property of such functions is as follows. 
Theorem . If a concave function attains its minimum on a line-free closed 
convex set, it does so at an extreme point. 

For the above result, as for the remainder of these lectures, we are 
assuming that the convex set is finite-dimensional. 

We would like to indicate a more or less constructive method of obtain- 
ing this minimizing extreme point, starting from an arbitrary minimizing 
point. Let us consider for the sake of simplicity a compact set, and 
let us first consider the simpler problem of finding an extreme point. 





A ray which is not extreme 



An extreme ray 
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Grlinbaum will probably describe a quick inductive proof of the existence of 
extreme points y but the following procedure is more constructive in nature. 

In the diagram, we start at any point x^^. If it is not extreme, it is 
the mid-point of some segment. Extend this segment in one direction as far as 
possible, to a boundary point x^. We continue the process with and in this 

way we obtain a sequence x^^,X2»... of points. We claim that this procedure 
must terminate after. at most d +1 steps, where d is the dimension of the set. 




To see this, note that the point x^ cannot be on the line ^c^x^, by the 
special choice of X2. We claim similarly that x^ cannot be in the plane 
x-x„x„, and so on. The reason is indicated in the following fxguie, if it 

JL M 

X, 




were possible to continue along the line L in the plane x^^x^x^, then, since 

we are in a convex set, the shaded triangle would be in that set. This implies 

that x-x„ can be extended beyond x„ which is a contradiction of the choice 
12 ^ 

of X2» Similar arguments hold for higher dimensions. Of course, the pictor- 
ial argument can be made formal. Now, if f is a given concave function, 
assumed continuous for the purposes of illustration, then a similar process may 
be applied to find its minimum. For if the minimum is attained at x^^, and if 

X is not extreme, it is an immediate consequence of the notion of concavity 

1 
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that the minimum is also attained at one or the other endpoint of any segment 
through x^, say at X2« We may then continue as above to prove the stated 
theorem. 

The required definitions of concavity and convexity, along with the pic- 
torial idea are as follows. 

Definition . Let C be a convex set, and let f be a real-valued func- 
tion on C, Then 

1) f is concave if f(\x + (l-\)y) § + (l-\)f(y) for all x and 

y in C and all X between 0 and 1: - — n 

2) f is convex if the above inequality is reversed. Equivalently, f is 

4 

convex if -f is concave: 

3) f is affine if it is both convex and concave; the above inequality 
is replaced by an equality: 

These formulations translate the usual geometric notions. Thus, f is 
concave, convex, or affine if the graph lies respectively above, below, or on 
any chord. 

It is clear why the minimum of a concave function on a line segment must 
be attained at an endpoint. For if not, we would have a graph with points: 



which is clearly not possible for a concave function. 

We now consider a specific class of minimization problems, namely, the 
transportation problems. Such a problem (an m x n transportation problem) 
is specified by a positive m-vector a « (a^,..,,a^), a^ > 0, and a positive 
n-vector b * ^i ^ where a^ » b^. We suppose that 

we have a homogeneous, infinitely divisible commodity located at the sources 
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A , ,A . The amount a. of the commodity is at the source A.. The problem 
1 m 1 ^ 

is to transport this commodity to the destinations (or sinks) in such s 

way that the total amount of the quantity arrives at . A solution of 

this problem is an m X n matrix x = » where x^^ is the amount of the 

material from source A^ which is sent to the sink B^. Thus, the solution 
X is a matrix which satisfies the conditions; 



*11*12 • • • 


a 

1 


1. 


*ij^° 




• 

• 


a 


2. 


^-1 *ij “ 


(row sum) 


b, b 

1 n 


lU 


3. 


^»1 *lj ’ *’j 


(column sum) 



The rectangle is a convenient pictorial device to help us remember the size of 
X and the row and column conditions. The solutions, as a set of m X n 
matrices, form a subset of Furthermore, the defining equations and in- 

equalities for X show that the set of such x*s is a convex polytope. To 
see this, *iote that each of the mn inequalities x^^ — 0 defines a closed 
half space, and each of the equalities determines a hyperplane which is the 
intersection of the two closed half spaces determined by that hyperplane. Thus 
the set of matrices x satisfying the required conditions is an intersection 
of finitely many closed half spaces, namely a polyhedron. It is clearly bounded, 
since each component is bounded by Thus it is a polytope. 

Definition . If a and b are vectors as above, the set of matrices x 
satisfying the above conditions is called the transportation poly tope T(a,b) • 
Now suppose there ate cost functions c^^ , where c^^ (of) Is the cost of 
sending an amount oi from source A^j^ to sink Bj • Then the cost of a 
particular transportation scheme x is simply 

C(x) S c^j (x^j) • 
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In most practical problems the functions c^^ will be concave, whence C is 
also concave. When the c *s are linear this is an example of a problem in 

ij 

linear programming, namely to minimize a linear function over a convex poly- 
hedron. We know that the minimum is attained at some vertex of the polytope. 

(A vertex is an extreme point of a polyhedron.) It is also easy to see that 
the set of minimizing points is the convex hull of the set of minimizing 
vertices. The subject of linear programming involves various algorithms to 
find these minimal vertices. We should like to point out that sometimes this is 
not' a simple problem even though there are only finitely many vertices. The 
number of vertices may be very large, so that a direct search is impractical. 
Further, the polyhedron is generally not given in terms of its vertices but 
only as the solution set of a system of linear inequalities. 

It is not the purpose of these lectures to discuss practical methods of 
solving linear programming problems. However we can sketch the practical 
technique usually employed. First find one vertex, (Often this is not 
trivial.) There are computationally practical ways whereby, having a 
vertex of the polyhedron, one can "look" at all of the adjacent vertices. 

Of the adjacent vertices, choose the one which gives the smallest value 
to the cost function or which in some other sense represents a maximum 
improvement. Then continue the process. It must terminate at a minimiz- 
ing vertex, as follows from a simple theorem in the geometry of polytopes. 

This is all for linear cost functions. For concave functions the situation 
is much more complicated . 

Although linear programming problems are concerned with finding an 
optimizing vertex of the feasible region, there are closely related problems 
in which one may want to know all of the vertices. An example would be a 
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transportation problem in which an enemy is choosing the non-negative coeffi- 
cients c. . of the linear cost function (subject to S c, , = 1) in the 
hope of maximizing C(x) . 

We now study the geometry of T(a,b). We precede this sequence of results 
by the remark that the subject itself is not of overwhelming mathematical 
interest. It is given simply as a sample of what could be done, in a course 
treating convex polytopes, to increase the students' contact with the more 
applied aspects of the subject. 

Theorem 1 . dim T(a,b) = (m-1) (n-1) 

We prove this result in several steps. 

a) The equality constraints (the row and column conditions) define a flat 
of dimension (m-1) (n-1). Each of the row and column sum conditions defines a 
hyperplane. The claim is that the intersection of these hyperplanes has the 
required dimension. Momentarily, the inequality constraints are being ignored. 

To prove a), we note that there are nri-n linear functions here. Let, for 

example, p^(x) = x^^ (= i— row sum) and column 

sum) . These m+n linear functions are not linearly independent, since 

S p = S V . However, if we delete v j the remaining m + n - 1 linear 
i i j ’j 'n 

functionals are linearly independent. (For example, we may easily arrange to 
make any one of these equal to 1, while the others equal 0.) Thus the rank 
of this system of linear functions equals m + n - 1 and the dimension of the 
flat in question is mn - (iiri-n-1) = (m-1) (n-1) . 

(Remark. Using the theorem of Caratheodory , this result shows that each 
feasible transportation scheme (point of T(a,b)) is a convex combination of 
(m-1) (n-1) +1 or fewer vertices. For n X n doubly stochastic matrices, 
the vertices may be identified as the permutation matrices. Thus, every 
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n X n doubly stochastic matrix is a convex combination of n - 2n + 2 or 
fewer permutation matrices.) 

b) For each i and j, there exists a vertex x of T(a,b) such 
that X. . > 0 , 

With no loss ^®^"=^nerality , we may assume i = m, and j = n. Then we use 

the "southeast corner rule," x = min(a ,b ) and proceed at each stage to 

mn m n 

fill in the southeast corner 



a g b 
m n 




n 



with as large an entry as possible; that is 

^ij " “ ^j<csn ^ic’ ' ^i<r^ ^r * 

Then we claim that the matrix so constructed is a vertex. The proof is by 
an easy induction which is left to the reader, 

c) There is a point of T(a,b) which is interior to the positive 
orthant determined by the inequalities x^^ > 0 . 

To see this, let p. . be the mn vertices determined in part b) , Now 
take their average, p = (S. . p..)/mn. Then p is a convex combination of 
vertices, hence an element of the polytope. But p clearly has all of its 
entries positive and hence is in the interior of the positive orthant. 



Finally, the proof of Theorem 1 follows immediately from a) and c) , since 
T(a,b) is the intersection of a flat of dimension (m-1) (n-1) which passes 
through an interior point of the positive orthant. 
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Discussion, 



The discussion clarified the final part of the proof of Theorem 1. In 
addition, the formal definitions of convex and concave functions were pre- 
sented, as given in the notes. The importance of these functions were stressed 
by Klee, insofar as practical maximum and minimum problems were concerned. For 
example, the minimum of a concave function is always attained at an extreme 
point, while for convex functions, a local minimum is a global minimum. 

Problems of minimizing or maximizing nonconvex and nonconcave functions tend to 
be much more difficult, 

Klee pointed out that transportation problems are a real application, not 
a contrived one. In particular, a significant portion of present day computer 
time is devoted to the solution of linear programming problems. 

The question of computing volumes of polytopes was brought up as a 
practical one in geometric probability. The difficulty even for dimensions 
2 and 3 was indicated. Two-dimensional problems can be done with some in- 
genuity, and higher dimensional problems with a large amount of symmetry can 
also be handled. But otherwise, nobody can do it, Klarakin pointed out that 
Polya, in his thesis, considered the problem of finding the volume cut off by 
two parallel hyperplanes in a cube , Explicit formulas were given and were 
derived by Laplace transforms. 
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Lecture 111 . 

We continue the discussion of the properties of transportation polytopes 
with a theorem about the number of facets of T(a,b). (A facet of a d-dimen- 
sional polytope is a face of dimension d-1. In the study of polytopes, the 
faces of extreme dimension (namely the facets and the vertices) are usually 
the most interesting faces.) The theorem is not of any great importance as 
far as applications are concerned, but in proving it we will learn things 
about the structure of the problem which are of interest for the applications. 
Assume from now on that m ^ n. This is really not an essential limitation, 
since mathematically there is no distinction between sources and sinks. 

Theorem 2. T(a,b) has exactly one facet when m = 1 and two facets 

when m == n = 2. The number of facets is otherwise between (m-l)n and mn. 



each possibility being realized by some transportation polytope. 

Proof. The first two cases are trivial. We prove a number of statements, 

each of which is easy by itself. 

a) Each facet of T(a,b) is of the form [x ^ T(a,b) ] a particular 

coordinate x = 0} . (This does not assert that each set of this form is a 
ij 

facet. Note that this does give the upper bound on the number of facets.) 

T(a,b) is the intersection of a flat with the positive orthant, but this 
orthant itself has exactly mn facets. Thus it suffices to prove that when 
P is a polyhedron, V is a flat, and F is a facet of P fl V then F is 

the intersection of V with some facet of P . P 

(For example if P is a cube, V a plane 
cutting its middle, the facets of P Pi V 
are the four edges of the (middle) square.) 

The proof uses a characterization of facets 
F of a polyhedron as maximal convex subsets 
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of the boundary of the polyhedron. 

b) A set of the form in a) is a facet if and only if its dimension 

is (m-1) (n-1) - 1. 

This is evident, since each set of this form is the intersection of the 
polytope (of dimension (m-1) (n-1)) with a supporting hyperplane. 

Thus we want to decide for which pairs (i.j) does this form give a facet 

of the polytope. 

0 ) Let <J = s“. a = 1’-. and fix (i.j). Note that a is the 

total amount of the commodity shipped, while a^ + is the sum of the 

amount shipped from source and the amount received by the sink . 

CT<a +b iff lx €T(a,b) 1 X,, = 0] misses T(a,b) 
i j 

o = a +b, iff Ixjx . = 0] determines a vertex of T(a,b), 
i J 

a > a^ + b iff “ 0} determines a facet of T(a,b). 

This gives a relation between the positive orthant and the flat determined by 
the set of a) . In general the Intersection of a supporting hyperplane with a 
poly tope may be a facet or a vertex, or a face of some Intermediate dimension. 

But here the intermediate possibilities cannot occur. 

Rather than prove this, we consider a picture of the situation. Each 
point of T(a,b) is a matrix x. as shown. Assume (for notational simplicity, 

and without loss of generality) that 

(i,j) ® (m,n). The question is: What can 

we say about matrices x in T(a,b) with 

X = 0? If + b > a, i.e., 
mn m n 

b > a - a « a , there can be no 

n m J-** ^ 

non-negative x in T (a »b) for which 

X s 0, for any such x would give b - ®i* 

mn 




a. 



a 



m 
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a + b « a it follows that all elements in the last row and column must be 
m n 

as large as possible. Thus the rest of the matrix must be zero, i.e., the 

hyperplane meets T(a,b) at exactly one point, so this point is a vertex of 

the poly tope. If % + entries in the last row and column may 

be chosen less than the maximum amount (e.g., x^^ < a^, etc.), and are sub- 

jeot only to the two restrictions and = a^. Thus 

(within restrictions) we have a "free choice" of (m-2) (n-2) elements of the 

reduced matrix and (m-2) and (n-2) free choices in the last row and column, 

i.e,, (m-2) (n-2) + (m-2) + (n-2) - (m-1) (n-1) - 1 free choices. Thus 

fxlx * 0} determines a facet of T(a,b). 

' mn 

Note that this result is of interest because if an opponent is choosing 

the cost function make use of the following fact; if 

a + b > cr then clearly every feasible scheme must send something from 
i j 

source to sink • 

d^ If a a + b and a g a + b , with r ^ u, s v, then 
/ r s u V 

m M n 2. That is, if we have two critical points in distinct rows and 
coliUDhs of X, then m * n * 2. 

This is clear since a^ S a - b^ » b^ S b^ S a - ** a^ S a^, 

with the second and fourth "S” being only if respectively n « 2 

and m » 2. Thus all these "bad points” (ijj) must be in the same row or 
in the same column, and there can be at most n of them (since m g n) . 

But these points (i,j) are precisely the places in the m X n matrix where 
{x € T(a,b) I X « 0} is not a facet, by part c) . Thus at least 

Ij 

mn - n ■ (m-l)n of these sets must be facets of T(a,b). This proves the 
theorem except for showing that all possibilities between (m-l)n and mn 
are actually realized. 
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We now turn to the number of vertices of T(a,b). We first characterize 
the vertices in a useful way. This characterization will not be explicit 
enough to allow us to count them in general. Leading up to thiSy I would like 
to mention a standard correspondence between matrices and bipartite graphs 
which will be useful in some of the counting processes. 

Associated with any m X n matrix we have an undirected graph whose 
nodes or vertices are divided into two sets, one set of nodes corresponding 
to the rows of the matrix, the other set of nodes corresponding to the columns 
of the matrix. We connect the i-~ row node to the j— ~ column node if and only 
if the corresponding entry in the matrix is non-zero. For example, this 
matrix and graph correspond. 



1 6 0 0 0 0 

0 3 5 8 0 0 

0 0 9 2 7 0 

0 0 0 0 0 4 



These graphs are associated with the transportation problem in a natural 
way, since we can think of the row nodes as sources and the column nodes as 
sinks. The arcs may then be labeled with the amount of shipment along that 
arc. We will prove that the feasible transportation schemes (i.e., points 
X € T(a,b)) which are vertices of T(a,b) are exactly those whose graphs 
are trees or ’’forests” (unions of a finite set of disjoint trees) , 

Definition. A loop for the matrix x is a sequence (ij » j^^) (i^^j 

successively change only one of the two numbers 




(rows) 



(numbers in each node are 
row or column sums) 




25 



1 or j , such that 



a) there are no repetitions, and 

b) the matrix has non-zero entries in all the indicated places , 
Equivalently, a matrix has a loop if and only if its graph has a circuit, since 
the only way you can leave a ”row” node is to go to a "column” node in a bi- 
partite graph, and to leave a .column node you must go to a row node. Thus the 
following theorem has a geometric as well as graph- theoretic interest. 

Theorem 3 . The vertices of the polytope T(a,b) are exactly those 

X € T(a,b) such that the matrix x admits no loop. 

Proof . Suppose x 6 T(a,b) and x has a loop 

(i- , j ), Set 2e = min fx^ . I (i,j) listed] > 0, Define a matrix y by 

is ij 

y s X . . if (i,j) is not listed in the loop, 
ij ij 

yji " 4 ^ ® (i>j) loop 

where e is alternately added and subtracted as we go around the loop. This 
has the effect of leaving all row sums and all column sums fixed and hence 
y, %(x+y) , and %(x-y) are all member.s of the convex polytope T(a,b), But 
X * i(x+y) + iCx-y) and thus x is not an extreme point. 

Conversely, if x € T(a,b) ^ ext T(a,b) we will show that the matrix x 
has a loop. If x is not a vertex of T(a,b) then there is some y such 
that X db y 6 T(a,b) , and the row sums and column sums of y must necessarily 
all be zero. Now choose y . ^0, Since all row and column sums of y are 

hh 



zero we can continue to choose pairs (i>j) which define a loop in y for 
which the corresponding y^^ are non-zero. Since x+y and x-y, being in 
T(a,b) must have all non-negative entries, it follows that each x^j must be 
positive and we thus have a loop in matrix x. This completes the proof. 
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Theorem 4 . For any matrix, the following are equivalent: 

a) X admits no loop 

b) X € T(a,b) is a vertex of T(a,b) 

c) every submatrix of x (obtained by deleting certain rows and columns) 

admits a distinguished row or column. (Definition: A row or column is 

distinguished if it has at most one non-zero element.) 

d) any r X c submatrix has at least r - c + 1 distinguished rows 

when r S c (distinguished columns when c 5 r) . 

e) every square submatrix has a distinguished row or a distinguished 

column 

f) every square submatrix has both a distinguished row and a distinguish- 
ed column 

g) every square submatrix of order k has at most 2k - 1 non-zero 
entries , 

The conditions of this theorem could, of course, be put into gfaph 
theoretic terms. l?or example, c) implies that the bipartite graph of the matrix 
X must have at least one "dead-end." To say a certain row is distinguished 
corresponds to saying that a particular row node is joined to at most one other 
node. Condition e) asserts that if we pick any set of nodes containing the 
same number of row nodes as column nodes . and the arcs which join these chosen 
nodes, then there must be at least one dead-end in that subgraph. Condition 
f) asserts that there is a dead-end tow node and a dead-end column node in 

each such subgraph. 

Hone of these conditions allows us tr count the vertices of a really 
complicated transportation poly tope, although they do allow us to look at 

certain points x € T(a,b) and say immediately that they are not vertices , 
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Discussion. 



Hausner pointed out that there are only finitely many combinatorial types 



of the polytope T(r,b) c and yet the combinatorial structure depends upon 

the continuous parameter (a,b) € E . He asked what determined the boundary 
of a set of points (a,b) € E which gave a transportation poly tope 
T(a,b) c of a particular combinatorial type . (Definition ; Polytopes 

P and P' are of the same combinatorial type if the lattices 3*(P) and 
3(P') of their faces are isomorphic.) Klee answered that this was typical of 
a class of similar questions that could be asked, most of which are hard. 
Perhaps you could prove that two transportation polytopes in had the 

same combinatorial type provided a certain characteristic function 
assumed the same value on the pairs (a,b) and (a',b') which defined the 
given transportation polytopes. The characteristic function x would be 
defined in terms of the class of all subsets of the set of m + n coordinates 
of (a,b). 

Certain game theory results have applications to economics or business, 
but no one suggested any peaceful applications of the game- theoretic trans- 
portation problem in which an enemy is determining the cost function, Klee 
remarked that since all "bad points" of a matrix x (i.e., pairs (i,j) for 
which a < a^ + b. in the notation of Theorem 2) must occur on one row or one 

i j 

column, there will be one critical source or sink in these cases whose lines 
of communication must be closely guarded. 



We will continue our discussion of transportation polytopes, but will omit 
the details of some of the counting arguments. These would be similar in 
spirit to those done above, but considerably more complicated and time-consum- 
ing. 

We first consider the possible number of vertices, A transportation 
problem determined by (a,b) c is called degenerate if the sum of the 

elements of some proper subset of {a^ | 1 ^ i ^ m} is equal to the sum of 
some proper subset of [b^ ] 1 ^ j ^ n] , A problem is degenerate if and only 
if some feasible solution has a graph which is disconnected. 

Theorem For a general m x a transportation problem (m g n) , the 
minimum number of vertices is 

nl 

(n-mrt-1) i 

and this is achieved in certain examples. For m X a non-degenerate problems 

tyimI 

the minimum number of vertices is n " , This gives a complete solution for 
the minimum number of vertices. 

Only partial results are known about the maximum number of vertices of 
the general m X n problem. 

Theorem 6, For m X a transportation problems, the number of vertices 
of T(a,b) is achieved by non-degenerate problems. 

Sketch of proof . Each T(a,b) is the intersection of a positive orthant 
with a flat. This poly tope can clearly be approximated as closely as desired 
by a polytope which represents a non-degenerate problem. One then makes use of 
the fact that the number of vertices of a poly tope is a lower semi- continuous 
function of the poly tope. 



We next consider the maximum number of vertices for regular 



(= non-degenerate) problems , 

Lemma 7, Two vertices of T(a,b) which have the same pattern of non- 
zero entries must be identical . 

Proof. Vertices x and y were characterized as matrices without loops. 
If X and y have the same non-zero pattern, then so must •g(x+y) . Clearly 
this cannot happen unless x = y because |(x+y) must also be a vertex. 

Since every extreme point of T(a,b) has a graph which is a tree, we can 
get upper estimates of the number of vertices by counting certain classes of 

trees . 

Theorem 8 . For any m X n transportation problem the number of vertices 

is at most the number m“'^n“'^ of spanning trees of the complete bipartite 

graph with m vertices in one set, n in the other. 

We can restrict ourselves to (connected) trees rather than forests (i.e., 

unions of disjoint trees) in the above theorem, because we have restricted the 

, , n-1 m-1 . . , 

problem to the non-degenerate case. It should be noted that m n is the 

number of "generalized" vertices of the d-polytope T(a,b) in the following 
sense. Each genuine vertex is the intersection of d hyperplanes determined 
by d-facets. However the intersection of d hyperplanes may or may not be in 
the polytope and thus it might not be a vertex. The above number counts the 
number of such intersections and thus is only an upper bound on the number of 
genuine vertices. One could ask, how close to m n vertices can you get 
with a transportation polytope? At present this seems to be a very hard 
problem, although some progress has been made. 

As an example, suppose we have n sources, each with supply 1/n, and 
n+1 sinks, each with demand l/(n+l). Then several observations may be made 



er|c 
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about the graph of a vertex of this poly tope: 

a) Each source must have at least two arcs leading to sinks, since 

1/n >1/ (n+1) . 

b) Each source must have exactly two arcs leading to sinks, since any 
(connected) tree with 2n+l nodes must have exactly 2n edges. Thus the bi 
partite graph might look like this: 




c) For every bipartite graph which is 2-valent at each source node, 
there is a feasible transportation scheme which may be assigned to the graph 
For the graph shown above the assigned scheme will be 




(where the amount k shown on each arc denotes that k/n(n+l) of the total 
commodity moves alon^ that arc) . 

d) The above arguments would work in a similar way if there were n 
sources, jjbn + 1 sinks, and therefore each source was (p,+l) -valent . 

e) The problem of determining the number of vertices to the transporta- 
tion polytope when we impose these symmetry conditions is again a matter of 
counting trees. The number turns out to be (n+l)"^ ^(nl) when there are n 
equal source nodes and n+1 equal sink nodes, or 

(M.n+1) 
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when there are n equal sources and |j,n + 1 equal sinks. 

Our discussion of transportation polytopes will end with the following 
comment. Much of the material presented above is well-known but some of it 
is new. A much more careful exposition, accompanied by complete references 
and proofs, can be found in a forthcoming paper by V. Klee and C. Witzgall. 



Discussion . 

It was pointed out that the upper bound in Theorem 8 was frequently quite 
a bit too large. Several other problems were mentioned which appear similar 
to the transportation problem, but which apparently are related only in that 
combinatorial techniques are needed. 

The symmetry of the last transportation problem example (n equally 
strong sources, n+1 equally strong sinks) leads to a question about the 
symmetry of the corresponding polytope T(a,b) and the determination of its 
facial structure. Klee mentioned that Griinbaum would consider the problem of 
determining the number of facets of a d-polytope with v vertices, and 
remarked that it would be interesting to have similar results for the class 
of polytopes which had a group of symmetries of a particular order . Grunbaum 
remarked that almost no results of this type are known. Klee pointed out that 
theorems on the facial structure of polytopes are of direct interest to linear 
programming problems. 

Woolf asked how proponents of a geometry course based on convexity, as 
Grunbaum and Klee have discussed, would meet the claim that geometry courses 
based on algebraic topology and differential geometry would be of more use to 
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a student in his further work. Klee was of the opinion that to do something 
well in those two areas, a much better background and level of maturity is 
necessary than to do a careful treatment of convexity. In particular many 
topics in convexity are more intuitively accessible and elementary. If a 
student were going on to research in those areas, then a thorough background 
at the undergraduate level could be preferable, but for an undergraduate 
general training, a course based on convexity would, in Klee*s opinion, be 
better. Paul Kelly pointed out that in planning this conference, it was felt 
that the place college geometry was least neglected was in differential 
geometry, and that it would be better to invite lecturers who would concentrate 
on the other parts of geometry which are in more trouble at the college level. 
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Lecture V. 



We will next consider another topic related to linear programming. 

Perhaps this should be called a problem whose answer would lead to an applica- 
tion rather than an application of geometry. Much of this lecture can be 
found in the last chapters of Griinbaum's book, Convex Poly topes , 

Linear programming problems are frequently solved by choosing a vertex of 
the polytope described by the linear constraints, investigating the neighboring 
vertices, and thus searching from one vertex to the next for the maximizing 
vertex. In such a search process we move along the edges from one vertex to 
the next, and it would be useful to have results which give upper bounds to 
the number of such iterations that a computer might have to perform. (The 
usefulness of a computer in solving problems often depends upon whether you 
know the problem will take at most 2 minutes, 2 days, or 2 years'.) 

Definition . Let G be a graph (for example, the set of edges and 
vertices of a polytope) . The distance 
6(x,y) between vertices x and y is 
the length of the shortest path (counting 
edges) between x and y. The diameter 
of G is max[6(x,y) \ x,y vertices of G] . 

If P is a poly tope, 6 (P) denotes the 
diameter of the graph of P. For example, if P is a cube, then 6(F) = 3. 

Definition. A, (d,n) is the maximum diameter of a d-polytope with n 

' ■ "" D 

facets. (The b is for bounded; if the b is omitted it is the same defini- 
tion for the class of (possibly unbounded) polyhedra.) 

Definition . A graph G is d-connected if it cannot be separated between 
any two vertices by removing fewer than d vertices and their adjoint edges. 
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Eor example, the graph of the cube shown above is 3-connected. (See Grunbaum*s 
lectures, Section 12, for further related results.) 

Before stating the main theorems, we make a few observations. 

Lemma 1 . The graph of a d-polytope is d-connected. Equivalently, for 
each pair of distinct vertices of a d-polytope, there exist d disjoint paths 
connecting these vertices with only the end-points in common. 

The proof is omitted. 

Lemma 2 . (Grunbaum-Motzkin) The maximum diameter of a d-polytope with 
V vertices is 

[^] + 1 . 

Proof. Suppose x and y are vertices. Consider d independent paths 
from X and y and let k be the length of the shortest. Internal to each 
path there must be at least k-1 vertices. Thus v S (k-l)d + 2, so 
k-1 S and k S [(v-2)/d] + 1. 

Cl 

Lemma 3. A, (d,n) is achieved by a simple d-polytope with n facets. 

■ D 

(P is simple if each vertex is d-valent.) 

Proof sketch . A small amount of "wiggling" of each face of the polytope 
will not increase the number of faces, but can make each vertex d-valent, and 
in general will produce additional vertices and edges out of old multi -valent 
vertices. Thus P may be assumed to be simple. 

This is useful in the 3-dimensional case. 

2n 

Lemma 4 . If d *= 3, A^(3,n) = 1^3"^ " 

Proof . Letting v, e, f denote the number of vertices, edges, and faces 
v-e+f==2 by Euler *s formula. P is simple implies that 2e = 3v. Thus 
V * 2f - 4, By Lemma 2 and 3, A^(3,f) S [(v-2)/3] + 1 = [2f/3] - 1. Easy 

examples show the bound is actually assumed. 
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Theorem . A^^(d,n) = [ - d + 2 when d g 3 or when n ^ d+5, 
except that A|^(4,9) =5. 

The proof is omitted. These give the only specific values of A^(d,n) 
that are known. 

The determination of A^^(d,2d) was a long outstanding problem in linear 
programming, and the conjecture that A(d,2d) = d was "proved” for d S 5 
before it was recently shown to be false even in E^. Thus problems of this 
type are difficult even though they are very easily stated. 

The conjecture of Hirsch was that A(d,n) ^ n - d. This has been shown 
false even in E^. However the bounded version of the conjecture 
A|^(d,n) ^ n-d is still open. A related conjecture that is very easy to state, 
and yet unsolved is: Any two vertices of a polytope can be joined by a path 

which does not "revisit" any facet (i.e., considering a path as a sequence of 
vertices, a path may travel around the vertices of a facet as far as desired, 
but once it leaves a facet it may never revisit the facet later in the 
sequence). If the latter conjecture were true, then the bounded version of 
Hirsch* s conjecture would be true. Actually, the following stronger form of the 



conjecture might be true. In any cell-complex 
any two vertices may be connected by 
a path not revisiting any cell. It is 
not known whether this conjecture is 
true even for 2-dimensional cell complexes 
embedded in E^. i would expect that these 

conjectures are probably false when d gets large, say d ^ 9. 

Details of these and similar results and problems may be found in 




Grunbaum's book. 
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Applications of Helly* s Theorem , 

There are several applications to science of Helly' s theorem or related 
results (see Grunbaum's lectures. Section 5) which could well be brought into a 
classroom discussion on Belly's theorem. One of these has to do with molecular 

genetics . 

In studying DNA molecules which specify the structure of certain phage 
particles, the following investigation procedure was used. The phage particles 
exist both in certain standard forms and in mutant forms. Each mutant arises 
from some blemish in the genetic structure. If the structure of the DNA 
molecule is represented by a graph, we are interested in knowing the structure 
of this graph, and particularly in the part of the graph which contains the 
blemished part which produces the mutant forms. There seemed to be experi- 
mental reasons for assuming that the blemished portion of each graph was a 
sub-arc of the graph rather than just some random subset. Further there was 
a way of telling when two of the blemished sub-arcs overlapped. The research- 
ers then asked whether this observed intersection pattern is consistent with 
the hypothesis that all blemished intervals came from a linear part of the 
graph of the DNA molecule. Or is it necessary to use more complicated 
structures than a line to observe the intersection pattern? 

This leads to the fol3.owing mathematical problem. Determine completely 
what intersection patterns can arise from finite families of intervals on a 
line. Helly 's theorem on the line gives one limitation at once; namely, if 
a finite family of intervals on the line is such that each two have a common 
point then their intersection is non-empty. But this does not apply directly 
here for we are concerned only with pairwise intersections. The intersection 
pattern itself could be thought of as a square matrix of zeros and ones, with 
0 in the (i,j) -place when ^ ^ mutants i and j, and 1 
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in the (i,j) -place if I. fl I. 0. The question then becomes, is this 

i J 

matrix consistent with the assumption that the intervals are chosen from a 
line? 

A more convenient way of describing this is in graph- theoretic language. 
The intersection graph of any finite family of sets is the graph whose nodes 
represent the sets, and two nodes are connected by an edge if and only if the 
corresponding sets have non-empty intersection. For example, the intersection 
graph of the line segments on the line at the left is the graph at the right. 




We will say that a graph is an interval graph if it is the intersection graph 
of a finite number of intervals on the line. The problem is now to character- 
ize all interval graphs in a useful way. We will state only one of the more 
satisfying characterizations of interval graphs. Most parts of the proof of 
this theorem would be understandable and interesting to undergraduates, A 
graph has the rigid circuit property if each circuit is decomposable into 
triangles, that is, each circuit with more than 3 edges has crossings. 

Theorem , 6 is an interval graph if and only if 

(1) it has the rigid circuit property, and 

(2) each three vertices of the graph admit an ordering so that every 
path from vertex 1 to vertex 3 either goes through vertex 2 or is only one 
edge away from vertex 2, 

To finish the story, it was shown that for the experimental results which 
were obtained, the intersection graph of the several hundred blemished 
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intervals in the mk molecules did indeed form an interval graph. Examples 
of this sort point out to students that there are useful and interesting things 
that can be said even about the intersection properties of convex sets on a 

line. 
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An application of barycentric coordinates . 

This application of geometry is of interest in connection with mineral 
engineering. It deals with the phase diagram of a multicomponent situation 
which is isobaric, isothermal, and in which no solid solutions are formed. 
For the sake of illustration, let us consider an example with just three 
primary components, labeled A, B, C. Suppose we are interested in all the 
possible chemical compounds which could be formed from combining different 
amounts of these three primary components and which would be stable at the 
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given pressure and temperature , Each such 
a point in the triangle ABC, where 
the unique barycentric coordinates 
of D with respect to A, B, and C 
determine the relative amounts of 
components A, B» and C that are 
present in D. (Generally it is 
possible only in parts of inorganic 
chemistry to determine a compound in 
terms of the amounts of the elements 
of primary components from which it 
is formed. This cannot be done with 



compound D can be represented as 



C 




carbohydrates, for example, but there are large areas within inorganic chem- 
istry where this rule does apply.) Along with all the vertices in this tri- 
angle which represent the different possible stable compounds, one has a 
decomposition of the triangle into smaller triangles, as shown. Of course, 
for a given set of vertices there are many such decompositions possible, but 
in fact there is only one such decomposition which represents the following 
important aspect of the situation. Suppose we started with a particular 
mixture of the compounds A, B, and C, and the relative amounts of each 



determine the barycentric coordinates of the point K in the obvious way. 

If this mixture is heated to a high temperature, so that all sorts of decom- 
position take place, and then is slowly cooled to the original temperature and 
pressure, the resulting mixture does not consist of the original 3 compounds 
A, B, C. Rather almost all of the resulting mixture will consist of the com- 
pounds which are represented by the vertices of the small triangle AED which 
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contains K. Furthermore, the relative amounts of the compounds A, E, and 
D will be determined by the barycentric coordinates of C in terms of A, E, 

and D. 

It is easy to see the importance of this in certain refinery processes. 
Given a particular type of ore, for example, this type of scheme is useful in 
determining what kinds of compounds can be produced and in what amounts. This 
amounts to knowing the compounds in the ore (A, B, and C) and determining 
which small simplex (triangle) the ore (point K) lies in, and what its bary- 
centric coordinates are. This is rather trivial in the 2-dimensional case 
since you can just draw a picture and see what is going on. In fact, this is 
just what people do. In the higher dimensional cases, barycentric coordinates 
must be computed in the standard ways. The problem mathematically, is the 
following. Let Po»Pi»---»Pd vertices of a d-dimensional simplex 

which contains the smaller simplex whose vertices are 
suppose z € conv(pQ,Pj^» . . . »P^j) . We wish to know whether 

_ / « .i N onH HO what are the affine coordinates of z 

z e conv(qQ,q^, . . .q^) » 

in terms of the q^. 

This sort of problem shows the student that there really are situations 
for which the actual computation of barycentric coordinates is desirable and 
a natural thing to do. The first ideas in computing barycentric coordinates 
are just the usual techniques with quotients of determinants, but there are 
practical refinements of these techniques which soon lead to the attempt to 
compute the coordinates in the most efficient manner. 
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Discussion. 



It vas asked how the decomposition of the triangle into smaller triangles 
was determined from knowing just the 3 original vertices and the locations of 
the interior vertices. Suppose for example, that you knew from the experiment 
that DEFG was a part of the diagram, 
and you wondered which of the two 
diagonals DF or GE should be 
added. The standard technique is 
to form e mixture whose relative 
amotints determine the coordinates 
of the point K, heat it, then 
cool it, and observe whether compounds 
E and G or compounds D and F 
result. Analogous techniques exist for the higher dimensional cases. 

References for this whole topic are: 

P. A. Beck, Journal of Applied Physics 16(1945) 808-815. 

L. A. Dahl, Tn...-nal nf Physical and Colloid Chemistry 52(1948) 698-760. 

V. Klee, Problem ia Barycentric Coordinates, Journal of Applied Physics 

Vol. 36, No. 6(1965) 1854-1856. 

P. M. Pepper, Journal of Applied Physics 20(1949) 754-760. 

Johnson pointed out that another application of barycentric coordinates 
has been in the partitioning of an integer number of legislature seats among 
various political parties, each of which has received a certain percentage of 
the total vote. In this setting the political parties determine the vertices 
of a simplex, and the simplex is subdivided into areas which represent certain 
divisions of the legislature. An article describing this is in Mathematic ^ 
Snapshots by Steinhaua . 
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CONVEX SETS AND THE COMBINATORIAL THEORY OF CONVEX POLYTOPES 

Lectures by Branko Griinbaum 
(Lecture notes by John Reay) 

Convexity is a subject that can easily be taught to undergraduates at the 
junior or senior levels, and there are several good reasons for doing so. 

First, it is possible to reach significant results without the necessity of a 
strong background in other fields. There is no prerequisite of any extensive 
techniques. Secondly, convexity is a tool which may be applied in many other 
fields of mathematics, including number theory, functional analysis, complex 
variables, and others. Also it develops a student *s geometric intuition and 
intuitive comprehension of the proofs and theorems . It introduces combinatorial 
reasoning which has application in many other fields. One of its main advan- 
tages, which is rare in other areas of geometry, is that it is possible to 
introduce students to many open problems early in the course. This makes it 
possible to show that there is much work yet to be done, and problems that have 
challenged our best efforts to find a solution. This type of course can be 
quite a contrast to many courses which would lead the student to believe that 
all mathematics is done and comes in a completed package. Finally, many proofs 
of convexity have arguments in 2- and 3-dimensional Sj_vJce which makes for easy 
understanding, even though the proofs are valid in higher dimensions. 

These notes cover only a small fraction of the directions in which con- 
vexity has developed. In the first part of the notes (Sections 1-9) 3 basic 
introduction is given to convexity and some of its main tools, applications, 
and references. Sample proofs are included, but many of the results are left 
in the form of unproved statements or exercises. These sections are not meant 
to be a complete review of the topics usually covered in a course on convexity, 
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but they are rather meant to supply a foundation for the combinatorial theory 
of convex polytopes, one of the rapidly expanding areas of convexity. This 
second part of the notes (Sections 10-14) is primarily designed to give an 
intuitive grasp of the important results, open problems and basic techniques 



of this area. 
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1, Algebraic prerequisites . 

The following facts, generally known from algebra and easily proved, will be 
used— without special mention— in the sequel. 

If X is a vector space over the reals R, and if A = is a 

n 

sequence of n vectors in X, any expression of the type 71 ^^^ ^i®i’ 

each € R» is a linear combination of the vectors in A. We shall say that 

A is linearly independent provided ^i^i ~ ^ implies “ ••• s= — 0; 

if X.a. =0 is possible with some ^ 0, we shall say that A is 

i=l i i ^ 

linearly dependent . Hence A is linearly dependent if and only if some member 

of A is a linear combination of the other members of A. 

A maximal linearly independent sequence is called a linear basis of X; 

this clearly means that each vector x 6 X is a linear combination of the 

elements of a linear basis of X. Moreover, this linear combination is unique; 

its coefficients are the coordinates of x relative to the given linear basis. 

Any two bases of the same space X have the same cardinality, known as the 

(linear) dimension of X. If X is d-dimensional for some finite cardinal d, 

and if A = (aj^,...,a^) and B = are two linear bases of X, 

there exists a regular linear transformation T of X onto itself such that 

Ta = b for i = l,...,d. Conversely, if A is a linear basis of X and 
i i 

and T is a regular linear transformation of X onto itself, then 
(Ta^,...,Ta^) is a basis for X. Any linearly independent sequence in X 

may be extended to a linear basis for X. 

If X is d-dimensional, a sequence A = (aj^,...,a^) of vectors in X is 

linearly independent if and only if the matrix 
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has rank n, where coordinates of a^ relative to a 

given linear basis of X. 

The linear dimension of the Euclidean d-space is d. A convenient 

linear basis of E^ is the standard basis formed by the vectors 

e = (1»0,...»0), e„ = (0,1,0, ... »0) » e, = (0,...,0,1). 

1 ^ 

If X and X* are two vector spaces of dimension d, and if x^,...,x^ 

form a basis of X, and form a basis of X*, a mapping T from 

X to X* may be defined by . This mapping T is 

an algebraic isomorphism between X and X*; moreover, if both X and X* 
are topological vector spaces, T is a homeomorphism. 

J 

The scalar product <a,b> of vectors a,b € E is the real number defined 



by 



<a,b> = a.p. 



The most important properties of the scalar product are 

<a,b> = <b,a> 

(\a,b^ — \(a,b^ 

(a+b,c) = (a,c> + (b,c) 

(a,a> S 0 with equality if and only if a * 0. 



If (a,b> = 0, then a and b are said to be orthogonal to each other. 
If (a, a) = 1, then a is called a unit vector . In the sequel, the letter 



u 



(with or without subscripts) shall be used only for unit vectors. 



^ 

A hyperplane H in is a set which may be defined as 

H = {x 6 E^ I <x,y) *= oj » suitable y € E , y ^ 0, and o'. An ogen 

halfspace (closed halfspace) is defined as {x € E^ 1 <x,y> > a] (respec- 
tively {x € 1 <x,y> S O'!) for suitable y 6. E^, y ^ 0, and a. Clearly, 

[x e E^ I <x,y> <a] is also an open halfspace for y ^ 0; similarly for 

closed half spaces. 

If X is a real vector space of finite dimension d, a norm || || may be 
defined on X which turns X into a metric space isometric with the Euclidean 
d-space E^. Thus each d-dimensional X is "essentially” the space E^. (The 
norm in X is defined by ||x|| = / (x-x)'. In order to find an isometry between 
X and E^, take any basis Xj^,...,x^ of X and "orthomormalize" it by 
putting y^^ Xj^ and 







<yi>yi> 




foiT k 



Let = y^ / i = Then 

basis of X and. If e is the standard basis of E 

d d 

formation T such that Te^ ” ®i’ ^^^i=l ^i^i^ ” ^i-1 ^i^i 

of the required type ,) 



form a linear 
, the trans- 
is an isometry 



The affine facts listed below may either be derived from their line . ^ 



counterparts, or be proved independently. 

If X is a vector space over the reals, and if A =* (a^^* • • • ® 

sequence of elements of X, any expression of the type where 

all \ € R and jP' X = 1> is an affine combination of the elements of A, 
We shall say that A is affinely Independent provided no member of A is an 



A7 



affine combination of the other members of A; otherwise A is affine^ 



dependent . 

A is affinely dependent if and only if there exist 

not all equal to 0, such that = 0 and = 0. 

^ - (a^,...,a ) is affinely dependent if and only if the sequence 

(a - a ,...,a - a^) is linearly dependent. 

«!• 

If X is of (linear) dimension d, and if the elements a^ of A 
have coordinates linear basis of X, then 

A is affinely independent if and only if the matrix 




has rank n. 

Any maximal, affinely independent sequence in X is an affir^ t>as _ ^ of X 

If A is a linear basis of X then A U {0} is an affine basis of X, 

^ „d , 

The set of all affine combinations of two different points x,y e E is 
the line L(x,y) = [(l-X)x + Xy j X real}. If x',y' ^ 

X* 4 y'» then L(x',y*) - L(x,y). 

If a set H has the property that L(x,y) CH whenever x,y € H, 

X 9 ^ y, we call H a flat , or an affine variety . Clearly, the set of all 

affine combinations formed from all finite srbsets of a given set A is a 

flat; it is denoted by aff A and is called the affine hull of A. The 

family of all flats in contains 0. all one-pointed sets, all 

lines, all hyperplanes; also, it is intersectional: if all H^'s are flats, 

SO is OH. The affine hull aff A of a set A may equivalently be 
Of Of 
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defined as the intersection of all flats which contain A. The formation of 
the affine hull is translation invariant ; i.e., aff(x+A) = x + aff A, (Note 
that aff (A+B) = aff A + aff B is not true in general .) 

d 

Every flat H is a translate H = x + V of some subspace V of E , 
and is therefore isomorphic to the Euclidean space of a certain dimension 
r g d; the dimension of H (and of V) is then r = dim H = dim V. 

Each r-dimensional flat contains r+1 affinely independent points, but 
each (r+2)-membered set of its points is affinely dependent. 

A transformation T of E^ into E is affine provided it preserves 

affine combinations; that is, 

“i^i 

whenever x 6 and ot’s are reals satisfying 

1 ^ 

Images, or Inverse Images, of flats under affine transformations are again 

d X 

flats. In particular, an affine transformation A from E to E = R is 
determined by Ax = <a,x>. If a ^ 0, the inverse image by A of any point 
O' 6 R is a hyperplane in E , denoted by 

H(a,o') =5 {x € E \ Ax = (a,x) = o'] . 

Conversely, each (d-l)-flat in E^ is expressible in this form for suitable 

a and oi and is, consequently, a hyperplane. 

The main advantage of the affine notions over the corresponding linear ones 
is their invariance under translations. Hence, among other advantages, their 
use in proofs avoids "choosing the origin" at some advantageous point. 

The assumption that X is a vector space over the reals is irrelevant 
to many of the definitions and facts listed above. Any field of characteristic 
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0 could be used instead of the real numbers. 



Discussion . 

It was generally felt that the affine notions (affine independence, trans- 
formations, etc.) were usually unfamiliar to the students. It was suggested 
that this could be due to a lack of affine topics in most linear algebra 
courses. Opinions about how much time was necessary to give an adequate founda 
tion in affine topics varied from one or two hours upwards. Klee suggested 
letting the word "blank" stand for any of the words linear, affine, positive, 
or convex, and then develop the general ideas of "blarik independence," "blank 
hulls," etc. as far as possible with comparisons of their differences. See 

Klee [1]. 



2 , Definition and elementary properties of convex sets. 



Let X be any vector (linear) space over the reals, of finite or infinite 
dimension. A set K c X is called convex provided 

(*) whenever x,y € K, then all points of the (straight-line) 
segment determined by x and y belong to K. 

Denoting by R the field of real numbers, condition (*) may be reformulated 
as 

(**) whenever x,y ’€ K and X € R satisfies 0 ^ X ^ 1, 
then Xx + (l-X)y € K. 

Examples of convex sets in any real vector space X: 

(i) the empty set 0; any single point; the whole space X: 

(ii) any (linear) subspace of X and, more generally, any 

(i.e., translate of a subspace) in X. 

(iii) any (closed or open) halfspace of X (i.e., any set of the 

type [x e X 1 cp(x) S o;} or [x € X j cp(x) > o;}, where of € R and tp is a 

non-trivial distributive functional on X; this means that tp is real-valued, 
<p(x) 4 0 for some x € X, and tp(oix + gy) = oftp(x) + ptp(y) whenever 

x,y € X, € R) . 

(iv) if X is a normed space, any closed or open ball in X 
(l.e„ set of the type C* € X 1 l|x-xj| S tr] or {x € X j Hx-xjl < 0 f]. where 
x^ € X, 0? € R, and a > 0) . 

Convex sets are general enough to appear in widely different fields, from 
elementary geometry to functional analysis. On the other hand, they have 
sufficiently much structure to allow significant results. 
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In the sequel we shall mostly assume that X is the d-dimensional 
Euclidean space though many of the results do not require this restriction, 

or may be modified to hold in more general spaces. 

d 

We list now a few fundamental properties of convex sets in E ; additional 

properties will be discussed later. 

1 . The family of all convex sets in E is inter sectional ; that 
means that for any non-empty family {K^} of convex sets in E , the inter- 
section n K is of the same type. All closed convex subsets of E also 

V V 

form an intersectional family, and so do all bounded convex sets, or all com- 
pact convex sets. 

2. Any convex combination of points belonging to a convex set 

K C E*^ belongs to K. This means 

(vv**) whenever n ^ 1, , . . . >0?^^ € R, ^ 0 for i = l,...,n 

and x^^,...,x^ € K implies 

^i=l Vi ^ 

Clearly, condition (**) used in defining convexity is the special case 
n r: 2 of (***). On the other hand, (**) implies (***), as is easily seen 
by induction on n; assume without loss of generality that ^ 
the identity 

^ .-^i-l 01 ^ 

1a= 1 “i*i = Vn + Zi=l T ’‘i’ 

Where M = a , . 

d d ^ 

3. If A is an affine transformation of E into E , and if 

K c E^ and K* c E*^ are convex sets, the AK = {Ax j x € K} is a convex 

T d 

set, and so is the Inverse image by A of K' , A’ K' - {«■ 6 E j Ax 6K'}. 
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Hence, in particular, the convexity of a set K is not affected by any 
translation (x+K) , homothecy (oK) , or orthogonal transformation. This may 
be used in proofs to reduce the more general situation to one in which a suit- 
able point is at the origin. (See, for example, the proof of Theorem 6.10.) 

4. If and are convex sets in then 

Ki + K2 * + y-2 1 \ ^ ^1’ ^2 ^ ® 

Note that we shall use the notation -K « (-l)K * {-x | x € K} 

and + (-K 2 ) . Hence "subtraction" of sets (K^ - K 2 ) is not 

the inverse operation of the (vector, or Minkowski) "addition* of sets 

+ K2) . 

5 . If K c is convex, then its topological closure cl K and 
its interior int K are also convex. Thus if x,y € int K and 

0 ^ X ^ th6n (Xx + € int K« 

Denoting the distance between points x,y €0*^ by |lx-y|l * / <x-y,x-y)> 
the convexity of cl K follows from the observation that if 
jjx^-xjj < 6 and Hy^"yll < 6 then, for 0 ^ X ^ 1> 

11 (XX^ + (l-X)yp - (\x + (1-X)y)ll ^ X||x^-xll + (l-X)lly^-yll < 6. An alternate 
proof of the convexity of cl K follows from 1, 3, and 4 above and the 
observation that cl K » fl^(K + cB) , where B is the unit ball centered at 
the origin. The convexity of int K follows from the fact that there is some 
open d-ball B centered at the origin for which x + B C K and y + B C K, 
Thus if 0 ^ X S 1 and if u € B then 

(Xx + (l-X)y) + n * X(x+u) + (1-X) (y+u) € K 
and therefore (Xk + (l-X)y) + B c K, so int K is convex. 

d 

6, Similarly, if x € int K and y € K where KC E is convex, 
and if 0 < X ^ 1> then Xx + (l-^.)y ^ iat K. 
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As in the last proof, if x + B c K then it follows that 
(XX + (l-X)y) + XB cz K. 

Remarks 

^ . _d , 

1. The intersectionality of the family of all convex sets in E may be 

used to define convex sets, or certain types of convex sets. For example, we 

shall see in Section 3 that the family of all closed convex proper subsets of 

may be characterized as the smallest intersectional family of subsets of 

which contains all closed half spaces . Similarly, the family of all compact 

convex subsets of E^^ is the smallest intersectional family of subsets of 

E^^ containing all closed d-balls . All convex proper subsets of E^^ may be 

obtained analogously by starting with "semi-spaces". (Semi-spaces may be 

explicitly defined in various ways: a particularly appealing definition-- 

though unsuitable if one intends to use semi-spaces to define convex sets-- 

is: a semi-space is any maximal, convex set which does not contain a given 

point .) 

Other instances of this approach to convexity will be discussed later. 

2. Modifications of conditions (**) or (***) are frequently 
encountered in the definitions of various families of sets in Euclidean spaces, 
or in more general vector spaces. A few examples are: 





Type of set 


Together with x and y 
the set also contains 


Whenever 
Ckf,p€R and 


(i) 


(linear) subspace 




Q(x + py 




no other condition 


(ii) 


flat (affine variety) 


Q?x + py 




Of + p = 1 


(ill) 


cone 




Q5C 




0? > 0 


(iv) 


convex cone 




o?x + py 




a ^0, p > 0 


(V) 


convex set 




cyx + py 




Ckf + P = 1» Qf,p ^ 0 


Linear, affine and positive 


combinations 


(generalizing (i) , (ii) , and 



(iv)) are defined similarly to the fashion in which convex combinations were 
defined by (***) . See Klee [l] , Some of the results on convex combinations 
we shall see later have analogues for the other types of combinations (or for 
some of them) . 

A number of other notions related to the above have been studied; such as, 
for example , the mid-point convexity (where with x and y the set contains 
^(x+y)). Some results are known on combinations with arbitrarily prescribed or 
restricted sets of coefficients. See the article by L. Danzer, B. Grunbaum, 
and V. Klee [l3 referred to hereafter as DGK [l3 -- and Motzkin tl3» 

3. It is sometimes convenient to consider projective transforma tion^ of 
convex sets in E^, A transformation P from E to itself is called 
projective provided it is possible to express P in the form 



Px = 



Ax + b 
<c,x>+6 ’ 



% 



d ^ 

where A is a linear transformation of E into itself, b,c € E , 

and at least one of c and 6 is different from 0. 

The transformation P is not defined on the set 



6 € R j 



N(P) = fy € 1 <c,y>+6 = 0} 



o 

ERIC 



55 



The set N(F) may be empty (if c = 0 ^ 6) . in which case P xs an affine 

transformation. If c ^ 0 then N(P) is a hyperplane (which, if A is 

regular, is mapped by P onto the "hyperplane at infinity" of the £ro j ect lve 

d. 

d-space containing E ) . 

If K is a set in E** and P a projective transformation as above, P 
is said to be permissible for K provided K 0 K(P) = <t . Generalizing 
property 3 we have: 

If K c e'* is a convex set and if P is a projective transformation 

permissible for K, then PK is convex, 

4, The notion of convexity has been modified in numerous ways to make it 

suitable for settings different from vector spaces over the reals. In 
particular see DGK [1], Section 9 especially. We shall mention here a few 
of these variants; it is rather instructive to investigate the results we shall 
encounter with respect to the modifications needed in their proofs to make 

them valid for the different "convexities.” 

(i) If Q is any ordered field, convexity in any vector space over Q 

may be defined by the analogue of (**) , This notion is useful in clarifying 
the role of completeness, closure, or compactness assumptions in various 
theorems . A particularly interesting special case is that in which finite- 
dimensional spaces over the field of rational numbers are considered. 

(li) Experience has shown that the most useful definition of convexity in 

the d-dimensional projective space P is: 

A subset KC p"* is convex provided there exists a (d- 1 ) -dimensional 
subspace of H of p"* such that K 0 H = 0, and if for each straight line 
L of P** the set K H L is either empty or connected. 
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Note that this setting is not interesting if we are dealing with a single 

convex set In P** since a projective transformation carries the hyperplane H 

onto the hyperplane at Infinity and the set K onto a set equivalent to a set 

in Note that property 1 does not hold for projective convexity, 

(iii) A number of notions of spherical convexity have been found useful . 

(See p, 157 of DGK [1] for four different definitions and further references.) 

The one we shall have occasion to mention later is the following. Let S 

denote the unit d-sphere in that is, S = [x 6 E ] (x,x) - 1}, A 

d+1 

set K c will be called convex provided the set cone^K c E is convex, 
where cone^K denotes the union of all closed rays (half-lines) originating at 
the origin 0 € and passing through a point of K. 



Exercises 

1, Prove in detail the assertions made above. 

2, Prove the convexity of the (closed) unit d-ball 

= [x € e'^ I 11x11 S 1] c: E^ 

3, Prove the convexity of every d-simplex, 

4, Show that each closed, midpoint convex subset of E^ is convex, 

5, Let {K^} be a family of convex sets in E^, Show that if every 

denumerable subfamily of {K^} has a non-empty intersection, then 

n ,K, 0. 

V V d 

6, Let K*, K" c: be called '’equivalent” provided there exist x € E 

and X > 0 such that K” = x + XK* . Prove that this is an equivalence 

relation, and that there are eleven distinct equivalence classes of convex 
subsets of E^ = R. What is the cardinal of the distinct classes of convex 
2 

subsets of E ? 




« 
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7. Let have the origin 0 as center of symmetry (i.e., K= -K) , 

and assume that K is compact and that int K 4 0. Show that the function 

d 

cp defined for x 6 E by 

?p(x) = inf{o? § 0 I X € Q?K} 

is a norm on E^. (That is 

(i) (p(x) 5 0 for all x 6 , with cp(x) = 0 if and only if x = 0 ; 

^ _d 

(ii) cp(x+y) S cp(x) + cp(y) for all x,y t E 

(iii) cp(Mc) = lajcpCx) for all x € s'* and all o; 6 R.) 

Equivalently, show that p(x,y) = tp(x-y) is a metric on E . 

8. If K is a compact convex subset of and B is the (topological) 

boundary of K, and if C = + h \ x-I « B} , then K = C. The 

example of a closed halfspace shows that "compact" may not be weakened 

to •'closed." 

t 



Discussion. 

Johnson pointed out that one of the various definitions of spherical 
convexity is equivalent to convexity in and raised the question of 

defining convexity in vector spaces over finite fields . This appears to 

be a difficult problem. 

Klee suggested the following proof of a slightly stronger version of 6; 
If q e cl K, P € Int K, and K is convex, then u = Xq + (X-I)P € int K 

for 0 S X < 
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As before, assume B is an open ball at the origin of radius 0 for 
which pH B c K. Consider the open ball B* centered at q of radius 




X . e, that is, the ball which is the reflection through the point u 

l-\ 

q 6 cl K implies that there is some x € K H B' . Then u lies in the 
relative interior of the line segment determined by x and some y € p 
The proof then proceeds as in 6. 
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Then 



B. 



3, Separation and support . 

Let K denote a convex subset of E , d ^ 0. We shall first prove; 

1. int K 55 if and only if K contains d+1 affinely independent 



points . 

In other words, the interior of K is non-empty if and only if the affine 

d 

hull aff K of K is the whole space E , 

Indeed, if a maximal affinely independent set of points of K contains d 
or fewer points, their affine hull contains K but has dimension d-1 or 
less--hence int K = 55. On the other hand, if K contains affinely independent 
points Xq,...,x^, then K contains the open simplex 

l4o Vi I 4o "o > ° 



hence int K 0. 

For AC E*^ let rel int A and rel bd A denote, respectively, the 
interior and the boundary of A considered as a subset of the flat aff A 

Ir 

(which is isometric to some E ) , From 1 there follows 

2, rel int K 0 whenever K ^ 0. 

Indeed, the following stronger result holds; 

3. int K c rel int K = rel int (cl K) c K c cl(rel int <K) * cl K. 

The proof of 3 uses 2 (or 1) and the technique used in the proof of 2,5 
(i.e,, result 5 in Section 2 ), 

Let A jA^ c: E*^; we shall say that A^ and A^ are se£arai^ ^ a, 
hvperplane H(u,X) = {x 6 ] <u.x> = X} (where u 6 E is a unit vector, 

and \ € R) provided A- is contained in one and in the other of the 

closed halfspaces H** == H^(u,X) = {x ^ E^ ] (u,x> S X] and h" = H“(u,X) = 

{x € E^ I (u,x) g X}. Note that a set may be separated from itself, for 
example, a line segment in E^, We shall say that non-empty sets A^ and k^ 
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are strictly separated by H(u,X) provided Aj^ is contained In one and 
in the other of the open halfspaces int H and int H . 

It is immediate that 

4. If H = H(u,\) separates and A^, then H separates cl A^ 

and cl A , and it strictly separates int A^^ and int A^. 

Cm 

The fundamental lemma on strict separation is 

5. If K and K« are compact convex sets In E such that 

• 1 z 

K n K = a, there exists a hyperplane which strictly separates and 

X 2 

Proof . Let o = inf tHy2-yill 1 yi « h’ ^2 ^ ° 8(^1 .K2). Since 

K and K, are compact and disjoint, o>0 end there exist 

X 2 

and X, € K, such that Hx^-x^H = c. Then, if denotes the hyperplane 

HCx^-x^, and if = HCx^-x^, convexity of 

K. implies that c c int and ^ int . 
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Hence, for any X such that 0 < X < 1, the hyper plane 
HCx^-K^, (X^^i + ( 1 -X)k 2 , V^2» separates and 
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then 



Indeed, if some point z of were in the interior of 

clearly some point on the line segment [z,Xj^3 would be in by convexity, 

and be closer to x^ than x^. Note that this argument is 2-dimensional, even 
though the proof is valid in . Also note that this is the only point where 
the convexity hypothesis enters. The only use made of compactness was to obtain 

the existence of x^^ and x^. 

Regarding the separation we have 

6. If K and K„ are bounded convex sets in such that 

1 ^ 

aff (K U K«) ^ then K. and K„ may be separated by a hyper plane if 

' 1 2 ^ 

and only if 

rel int Pi rel int = 0 . 

The "only if" assertion is an easy exercise; we shall prove here only the 

"if" part of the theorem, assuming without loss of generality (see 3 and 4) 

that K and K„ are closed (hence compact) . For 0 < c <1, let 
1 2 

Kj^(e) = Xq + (l-e)(-XQ + and = Fq + (1-®) ("Fq + V’ 

XQ^relintK^ and y^e rel int are fixed. Then 



rel int = Uq ^ g < K^(e) ^or i = 1,2, 

K.(e*) c: rel int K. (e) for 0 < e < e' < 1 and i » 1>2, 

Since K (e) and K„(c) are compact and disjoint, 5 implies the existence 
of a hyperplane H(e) = H(u(e) , 1(e)) which strictly separates K^(e) and 
K„(e). Since each H(e) intersectc the segment [x^.y^], the set 
{\(e) 1 0 < e < 1] is bounded. Together with the compactness of the unit 
sphere 8*^“^ which contains the set {u(e) \ 0 < e < 1}, this implies the 
existence of a sequence (Sj^» . . . >6^^* • • •) » ii“ * 0, for which the following 
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limits exist X = llxa \(e„) and u = Urn u(e^) . Ihen the hyperplane 
H(u,l) is easily seen to separate rel int Kj^ and rel int K2. hence also 
and K^. 

An alternate proof of 5 can he given based upon the following assertions 
If C, is a non-empty convex proper subset of and C2 = E ~ Cj^ is 

also convex, then cl C. is a (closed) half space and bd Cj^ = bd C2 - a 
hyperplane. (See Hanmer [1].) Assume K, and K2 are disjoint convex sets 
in E^ and x is any point of e'*. We use the following Lemma: Either 
K, n conv({x} U K2) = or else K2 0 conv({x} U K,) - where conv A 
denotes the smallest convex set which contains A. (See Section 4.) 

well-order all points in the space, and with this Lemma add x^ to one 
of the two sets to obtain disjoint convex sets K21. one of which 

contains x^. and for which c la a similar fashion expand these two 

sets to two new disjoint convex sets Kj^2> ^ZZ’ which con..ai ^ 



By transfinite induction, we obtain two disjoint convex sets K| and K* 

whose union is e"* and for which CK*. By the first assertion these sets 
are separated by the hyperplane which forms their common boundary. Thus it 
suffices to establish the lemma. Its denial asserts that for some point x. 
n conv(tx} U K2) 0 and K2 H conv({x] thus there must be 

points and a point x £ E^ and points 

n rel int conv 

y« € K„ 0 rel int conv {x,k^l. 
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€ K 



1 



This two-dimensional situation clearly leads to the existence of a point 

z € K n K„, a contradiction. Thus 5 holds. 

X ^ 

Both of 5 and 6 may be strengthened by weakening the hypotheses, but the 

basic ideas of these proofs already appear in the above formulations. In 5, 

for example, we may demand that only one of the closed sets be compact. 

Another application of the compactness argument (relative to u(e) and \(g)) 

allows one to strike "bounded" from 6 and to establish 

d ^ 

7 If K and K are convex sets in E such that aff U ^2) ~ E , 
1 2 



then 



K and K„ may be separated by some hyper plane if and only if 
X ^ 



rel int fl rel int = 

As simple consequences of the above we have 

8. Each closed convex set K in is the intersection of all the 

closed (or of all the open) half spaces of E^ which contain K. Each open 

d 

convex set K in E° is the intersection of all the open halfspaces of E 
which contain K. 

9. If K is a convex set in E^ and if C is a convex set such that 
C <= bd K, then there exists a hyperplane which separates K and C. 

* * * * 

Let Ac E^, let x € be a non-zero vector, and let X € R. The 
hyperplane H(x,X) is called a supporting hyperpla^ of A provided either 
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X = sup[<x,a> 1 a e A} or \ = inft<x,a> 1 a € A}; in the first case we 

shall say that H(x,X) is the supporting hyperplane of A in direction x 

(or with outward normal x) > and we shall denote it by H(A,x) . 

From 9 we have: 

10. If Kce"* is convex, and if C c bd K is convex (in particular, 

if C is a single point of bd K) , there exists a supporting hyperplane H 

of K such that C C H, 

The following independent proof of 10 can also be given when C is the 
single point p. Let B be a closed ball with center p and let y be a 
point in B which is farthest from the set K. Now y ^ cl K since p must 




be in bd K, so there is a point z € cl K which is closest to y. Let H 
be the hyper plane normal to [z,y] through z. As in previous arguments, 

y and K are separated by H, so if z = p, then H must be the desired 

supporting hyper plane and we are done. If z ^ p then the hyper plane H* 
parallel to H and through y must meet the interior of B. Thus some point 

u € B must be farther from K than y, a contradiction. (See Botts [l].) 

A partial converse of 10 is 

11 If K c is compact and convex, then each supporting hyper plane 
H(y»cy) contains at least one point x such that (x,y) - Q?« 
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Note that compactness cannot be weakened to "closed" in 11; e.g.» let 

K = [(a,b) € 1 a > 0, b > 1/a} and H = [(a,b) | b = 0} . 

* * * * 

There are two types of separation problems implicit in the above. We may 
ask how strong a separation is possible between two convex sets with given 
properties. Also we may assume certain conditions on one of the sets and 
require a certain type of separation, and ask what properties are necessary 
for the second set. In general, the stronger the given conditions on the first 
set, the weaker are the necessary conditions on the second set. See Klee 
for theorems of this type. 

It should also be noted that the separation of a flat and a convex set may 
be interpreted in a natural way as the Hahn- Banach theorem. 



Exercises . 

1. Show that for any convex K c E we have rel bd K = 0 if snd only if 
K = aff K. Is the assumption that K be convex necessary? 

2 . Prove 2 and 3 , 

3. If K is not convex, is it possible that all the sets mentioned in 3 be 
different? Generalize, by taking repeated closures and relative interiors, 

4. Prove that and K 2 may be [strictly] separated by a hyperplane if and 

only if it is possible to [strictly] separate 0 and - K^. 

5. Show that 5 remains valid if the assumptions on the convex sets 
K- ,K« c E^ are weakened to either 

(1) 6(Kj^.K2) > 0; 

or (11) Is bounded and cl 0 cl = 0. 



1 
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6. prove 7, by considering intersections of and K2 with a sequence of 

balls with radii tending to infinity, and using 6. 

7. Find examples in which K c is a closed convex set and H is a 

^ r V vf n K - d Show that for every bounded 

supporting hyperplane of K, but H n K - V). anow uu 

set A and every supporting hyperplane H of A we have H 0 cl A ^ , 

8. Prove the following converse of 10. (Compare the figure.) If A is a 

closed subset of with Int A # 0 , and if for each w € bd A there 

exists a supporting hyperplane H of A such that w e H, then A is 



convex 




x,y € A, z € A, v € int A, w € bd A 



9. Characterise those convex subsets of e" which have no supporting hyper- 
planes . 

10. prove that each compact convex set in E** is the Intersection of closed 
balls • 
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11 If K and K are disjoint compact convex setSy then the set of all 
® 1 2 

pairs (ujOf) such that HCuja) strictly separates and is an 

d*"X 

open subset of S * X R. 

12, Let A denote a non-empty subset of E . The supporting function 
h(x,A) of A is defined for all x € E*^ by 

h(x,A) = sup[<y,x> I y €A}. 

If for some non- zero x € E*^ we have h(x,A) < “ then H(A;x) - 
{z € E*^ 1 <z,x> = h(x,A)] is a supporting hyperplane of A in direction 

X. 

Prove; 

(i) The supporting function h(x,A) is positively homogeneous and 
convex, that is it satisfies 

h(A?c,A) = Xh(x,A) 

and h(x+y,A) g h(x,A) + h(y,A) 

for all \ § 0 and x,y € E*^. 

(ii) If A ^ 0, \ S 0, and x,y € E*^ then 

h(x,y+A) = (x,y) + h(x,A) 
h(x,XA) =• Xh(x,A) 
h(x,cl A) = h(x,A). 

d 

(iii) If A- and A„ are non-empty and if 0 ^ x € E , then 

X ^ 

H(A^ + A^; X) = H(A^; X) + HCA^; X) 

(A^ + A^) n H(Aj^ + A^; x) *= (A^ n H(Aj^; x)) fl (A 2 H HCA^; ?*)). 
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(iv) If h(x) is any postively homogeneous and convex function 
on such that h(0) = 0, there exists a unique non-empty closed 

convex set K such that 

d 

h(x) = h(x,K) for all x € E . 

13. Find meaningful variants of the notions and results of this section 
relevant to spherically, or projectively, convex sets. 

14. What becomes of the results of the present section if one considers con 

vexity in the rational d-space? 



Discussion . 

Heusner suggested that a better name for "relative interior would be 
"absolute interior," since it depends only on the given set and not the dimen- 
sion of the space in which it is embedded. Perhaps it should be treated as 
the core of the set. Apparently the only justification for the use of the 
word "relative" is that the relative interior is just the usual interior in 
the affine subspace determined by the set when this space is given the relative 

topology. 

Klee pointed out that the transflnite Induction argument used in the 

second proof of 5 could be modified to a usual induction argument (thus being 

more palatable to undergraduates) by considering a countable dense subset of 

d 

rather than well-ordering all the points of E . 

In a discussion of infinite-dimensional separation theorems it was observed 
that the algebraic and topological structures are not as closely related as they 
are in the finite dimensional case, and a part of the problem is that hyper- 
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planes correspond to continuous linear functionals. The separation theorems 
are still valid in Infinite dimensional space provided suitable hypotheses ate 
added— Indeed, they are major tools in the theory but it is not true that two 
disjoint convex sets may he separated, as the following example shows: Let X 

be the (non-complete) vector space of all sequences x = (Xj^,X 2 ,,.,) of real 
numbers, all but a finite number of which are zero. Let c X be the subset 
of all sequences whose last non-zero number is positive. Let K 2 = -Kj^ U {0} • 

Then K and K, are convex disjoint subsets of X and U K 2 - X, Yet 
1 ^ 

each K. is topologically dense in X, Moreover, if 
1 

X = »0 » • • •) ^ 

(i,e,, \> 0 ) and y = (0,, , .0,-l,0,0, , , ,) with -1 in the (n+l)th place, 

then the halfray {x + ty ] t > 0} c K 2 has x £ as endpoint, A similar 
statement is true for each x 6 K 2 , When presenting this example Klee gave the 
opinion that infinite-dimensional examples and results should he used to give a 
course broader scope whenever this may be easily done with only a little 

extra time, 

Killgrove and Stratopoulos observed that a certain amount of algebra, 
topology and analysis seems to be a prerequisite for any undergraduate course 
using this material. After considerable discussion on the level at which a 
college geometry course should be taught and the amount of topology to be in- 
cluded, Kelly remarked that we are aiming here for an ideal course to be taught 
at good universities by a good staff. The universities have an important in- 
fluence on the small colleges and courses designed for high school teachers. 

It is important that geometry courses should keep the respect of the professK- 
and appear as an alive subject related to the rest of modern mathematics , since 
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if it is thus accepted at the good schools , it will filter down to the weaker 
schools in perhaps a weaker form, hut without the feeling that geometry is a 
dead subject that can well be left out of a: crowded curriculum. 
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4, Convex hulls . 

The space is convex [and closed], and the intersection of any family 

of convex [and closed] sets is again convex [and closed]. Therefore the 
following definitions make sense: 

The convex hull conv A of a subset A of E is the intersection of 
all the convex sets in E^ which contain A, The closed convex hull 
cl conv A of A c is the intersection of all the closed convex subsets 
of E^ which contain A, 

Clearly, if A is bounded, so are conv A and cl conv A, 

An immediate consequence of the definitions is 

1 , For every A cz we have cl (conv A) = cl conv A , 

Proposition 8 from the pieceding section implies 

2. cl conv A is the intersection of all the closed half spaces 

which contain A. 

A useful representation of conv A is given by 

d 

3 , The convex hull conv A of a non-empty set A cz E is the 

set of all points which may be represented as convex combinations of points 

of A; that is, points which can be written in the form °^i^i’ 

x 0^, — 0 , S. - O', = lj n=l,2,,,. , 

i i i*=i i 

In many applications the following result is very important. 

4, If A is a compact subset of E*^ then conv A is closed; 
in other words, for compact A we have cl conv A - conv A, 

Using the results of the preceding section it is not hard to give a 
direct proof of 4, by induction on the dimension d. Since a much simpler 
proof results from Caratheodory's theorem, we defer the proof of 4 till we 

establish 5. 
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The following theorem is one on the basic results in convexity, and has 

important application in other fields . 

5. (Oarathlodory's theorem) If A is a subset of then each 

X € conv A is expressible in the form x = ^ ’ 

“i S 0, = 1. 

Proof ■ Let X € conv A be given; let x = (with x^ € A. 

a =0. a. = 1) b® a representation of x as a convex combination of 

points of A. Involving the smallest possible number p+1 of points of A. 
We shall prove Carathlodory's theorem by showing p S d. Indeed, assuming 
p g d + 1 it follows that the set (Xp,....Xp) is affinely dependent. Thus 
there exist 0 S i s p, not all equal to 0, such that = 0 

and S?_ p =0. Without loss of generality we choose the notation so that 

x^O i 



For 



R >0 and for all those i (0 S i S p-1) for which P^^ > 0 

0 S i S P-1. l®t Yi = “i - pj Pi- ^i-0 Vi = ^i»0 “i ■ p; "=i"0 Pi 

Moreover, S 0; indeed, if Pj^ S 0 then S S 0, if Pj^ > 0 

Vi = ^i=o ° ^ 

is a representation of x as a convex combination of less than p+1 points 
of A, contradicting the assumed minimality of p. This completes the proof 

of Caratheodory*s theorem. 

The proof of 4 is now immediate. Indeed, if x € cl conv A, there 
exists a sequence x^ £ conv A such that x - Urn By Carathe'odory's 

theorem each x^ - ^'n.i ^ ° ^ ^“.1 ^ ^ 

each n. The compactness of [0.1] and of A guarantees the existence 

of converging subsequences (^Uj^.i^ ^”k’^ 



( 1 ) 
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and lim X , = Then obviously 0 S S 1 



» ^i=0 ^ 



=1, € A, 



fc -*00 \! 

and X = as claimed. 

A result closely related to Caratheodory * s , in the sense that eith 

easily derivable from the other, is Radon* s theorem: 

6. If A is a (d+2)-pointed subset of it is possible to find 

disjoint subsets A*. A** of A such that conv A* 0 oonv A** ^ 0, 

A direct proof of Radon's theorem is very easy. Let A = (Xg . . . • .J^a+l) • 
Since d+2 points in d-space are affinely dependent there exist 0!^. not 
alleiual 0. such that = 0 and = 0. Without loss of 

generality ve choose the notation so that are positive and 



, • • • 



>01 



d+1 



non-positive. Then 0 ^ p ^ d. Let oi - ^_q ^ ^ 

01 . 



P+1 - - - 

define 3^ = ^ for 0 . i . P and v, = 4 ^ ^ 

If fine dependence of A can be rewritten in the form S|„g Pj^x^ = Yj.*!' 

„ vP R = V =1. this relation expresses 

Since S 0, S 0, and isp+1 ^i 

y \ n X ^ 5^ 0* as claimed by Radon's theorem, 

conv(xQ,...»Xp) n conv(Xp^j^,...,Xj^j^; f v> 



Exercises , ^ 

1. Show that a hyperplane H c supports a set AcE if and only 

if H supports conv A, 

2. Proposition 4 states that the convex hull of a compact set is compact; 
show that the convex hull of an open set is open. The convex hull of a closed 
set is not necessarily closed; find a closed set A 0 such that conv A 

is an open proper subset of the whole space, 

3 . For ACE^ let T(A) - C|(Xj^+X2) 1 X1.X2 € A}; let T(A)»T(A), 
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and for nSl let (A) = X (t" (A) ) . Denote T*(A) = Uj,gj^T“(A) . Show 

that cl T*(A) = cl conv A, although in general X*(A) ^ conv A, If A = bd K 
where K is a bounded convex set in E , d £ 2, show that cl K T( )• 

4. For A c E*^ let e(A) = + (l-Dx^ 1 ^ ° ^ ’ 

Define e^A) = 9(A) and e“+^A) = 9(9“(A)) for n S 1. Show that 

^ d 

conv A = U , 9“(A) . Characterize those convex sets K c E for which 



K = 0(bd K) . See Bonnice - Klee [1], Section 1. 

5. Prove Steinitz's theorem: If x £ int conv A c e'^. there exists 

a subset A* of A, containing at most 2d points such that 
X 6 int conv A*. Show that the number 2d may not be decreased in general, 
and characterize those A and x for which 2d points are needed in A. 

6, Let ACE*’ be a finite set. Then x € rel int conv A if and 
only if X is representable as a convex combination of all po. s of A, 
With all coefficients positive . 



7. Show that in Radon's Theorem 6 the sets A* and A** are unique 

if and only if every d+1 points of A are affinely independent. Show also 
that in this case two points of A belong to the same set if and only 
they are separated by the hyperplane determined by tlie remaining d points. 

8. What becomes of the above results if convex sets in the d-sphere, 
the projective d-space, or the rational d-space are considered? 

What are the analogues of the above results if the convex combinations 
are replaced by other types of combinations? (See Section 2, remark 2.) 
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Discussion . 

Klee presented the following alternate proof of 4: Let 

B^+l = 1^) 6 1 X. S 0, E = 1} ’'hioh is clearly compact 



Let X denote the compact set 



X A X ... X A . 
d+1 times 



d a^-»7^ Xa.By Caratheodory's 

Map X into E by ((Aq>...AP> ^o’*“’^d^ i=0 ^i i 

theorem, the image of X is exactly conv A, and the map is clearly con- 
tinuous . The continuous image conv A of the compact set X must be 

compact . 

Grlinbaum remarked that the original proof of Carathlodory used induction 
on the dimension d. This had the disadvantage of needing to know results 
about the intersection of c<xnpact sets with their supporting hyperplanes, while 

the present proof is more elementary. 

Klee pointed out that in Carathe'odory 's theorem, if P € conv A and 

x6A, then p6B for some set BcA of at most d+1 points, one of 

^ u V mav be thus specified in advance. A 

which is X. Only one such point x may p 

slight modification of the given proof gets this stronger result. 

A similar generalization was reported recently by Motzkin [2] . 

carathlodory- s theorem has had a large number of uses in other fields as well 

as convexity; for example, use of Carathe'odory 's theorem and the separation 

theorems have significantly simplified proofs of some results about doubly 

Stochastic matrices. 

Johnson and Griinbaum mentioned that if the given set of d+2 p 
Radon's theorem is not degenerate, specifically if the points are in general 
position, then there is a unique way to divide them into two disjoint subsets 
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whose convex hulls meet, 

Reay mentioned that If a set Ac e“ of (d+1) + (k+1) points 
(0 s k S d) is in general position, then it may be divided into two disjoint 



subsets whose convex hulls meet in a k-dlmenslonal set. More generally, if 
A has (r-l)(d+l) + (k+1) points, then it may be divided into r disjoint 
subsets whose convex hulls meet in a k-dlmensional set. However, when r > 2, 
then an Independence stronger than general position is necessary for the set 



A. See Reay [1], 
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5. Helly*s Theorem . 

The present section is devoted to a result known as Helly's theorem. It 
has many applications to different fields, and it occupies a central position 

among combinatorial-geometric problems. 

1. (Helly) Let K = [K^,...,K^} be a finite family of convex sets in 
E^, d = 1. If for each subfamily K* of K, consisting of at most d+1 
sets, there exists a point common to all members of then there exists 

a point common to all sets of K. 

It is easy to give examples which show that the convexity of each set 

K is necessary. If K c: is the closed ray [n,oo) for each integer 

i n 

all hypotheses hold except the finiteness of K, yst the conclusion 
fails. So K must be finite. (See 2 below.) 

We shall give two proofs of Helly's theorem; each of them exhibits 
certain features which reappear in the proofs of many combinatorial-geometric 



results. Note that no generality is lost in assuming n ^ d + 2. 

The first proof is due to Helly himself. It consists of two technically 

distinct steps . 

(i) It is enough to prove the theorem under the additional assumption 

that each set is compact. Indeed for each (d+1) -tuple ^ 

[l,...,n} let x(iQ,...,ip € n^_Q . Let , for j = l,...,n, be 

defined by X^ = -^x(iQ, . . . ,i^) j j € [i^ , . . . ji^} and let = conv X^ . 

Since X is a finite set, each C. is compact; also, X c: K implies 
j J J J 

C CK . Moreover, the intersection of any d+1 sets C, , . . . ,C is non- 

j j Jo ^d 

empty, since it contains x(j^ , . . . , j ^) . But if Helly's theorem is assumed 
for families of compact convex sets, we have 
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nU \ ^ '^1=1 °i ^ 

hence, the validity of Kelly's theorem for general convex sets. Note that the 

finiteness of K is necessary in this part of the proof. 

(li) In order to prove Kelly’s theorem for families K consisting of 

compact convex sets we first note its validity for d = 1. Indeed, it is 
i^edlate that each K. (which is, for d = 1, a closed segment, possibly 
degenerating to a point) contains the rightmost of the left endpoints of the 
members of K. Assume now that the theorem is false; among the families for 
which it falls, choose one for which d is as small as possible; then 
d > 1. Among all the families in that E** for which the theorem does not 
hold, select a family K with smallest possible cardinality n. Then 
n > d + 1, and for each subfamily of K containing n-1 members, the inter- 
section of its members is non-empty. Let K = n“lj ^ 0 ; then K 0 K„ = 0 
and by Theorem 3.5 there exists a hyperplane H strictly separating K and 

K^. In particular, K 0 K = 0 . Let fl K for i = l n-1. Then 

each C. is a compact convex subset of the (d-1) -dimensional space K; 

1 j i\ j. 1 QP^q c ,C have a non-empty intersection; 

moreover, each d = (d-1) + 1 sets » • • • 

JL 

pd^^ = K n ^i.) because has points in as well 

as in K. Kence, by the minimality assumption on K and the choice of H 
we have the contradiction 0 0 fl^lj = K fl (n^lj = K fl K = 0 . This 
completes the first proof of Helly's theorem. 

The second proof of Kelly's theorem goes back to Radon. We began by 
establishing its validity in case n = d H- 2. Simplifying the notation used 
in (i) above, let, for i = 1, 2 , . . . , n = d + 2, 
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By Radon*s Theorem 4.6 there exist sets ^ [l>...»n3 such that 

n = 0» Ji u J 2 = 

conv {x^ I i € J^} n conv {x^ j i € J 2 } 7 ^ 0 • 

Let X be any point of this intersection; then x € for each j € J 2 
since each such K. contains all x. with i 6 and hence K. contains 

also their convex hull. Reversing the roles of and we see that 

X G K. also for all j G i.e., x G 7^ CJ. 

Hence Kelly' s theorem is valid for all families in E consisting of 

n S d+2 sets. Now we apply induction on n. From a family K = 

with n > d+2, we form a family C' = • • • >^n-l^ ’ where 0 

for i=l,...,n-l. The intersection of each d+1 members of C' is non-empty 
since it coincides with the intersection of some d+2 members of K, which 
is non-empty by the above special case. But the inductive assumption then 
implies 0 + n"“]; Kj , and the proof is completed . 

Using the standard compactness arguments, it is easy to derive from 1 

the following infinite version of Kelly' s theorem. 

2 . If K is any family of compact convex subsets of E such that 
every d+1 or fewer members of K have a non-empty intersection, then the 

intersection of all members of K is non-empty. 

An extensive list of results related to Kelly’s theorem, with references 
to the original papers , may be found in the Danzer-Grunbaum-Klee survey 
listed in the bibliography. The following exercises are meant only to give 
the reader the opportunity to try his hand at applying either one of the 
above versions of Kelly’s theorem, or the techniques used in their proofs. 
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Exercises . 



1, Theorem 2 remains valid if the convex sets are assumed to be 

closed, provided some finite subfamily has a bounded intersection, (Even 

the "finite" of the preceding sentence may be omitted, but then some additional 

facts — e,g.. Theorem 7,3 — have to be used,) 

2, Let K be a family of compact, convex subsets of E such that, 
for some fixed k with 1 § k S d+1, each k or fewer members of K have 
a common point , Then for every flat L c of dimension d + 1 - k there 
exists a flat L' of the same dimension, parallel to L which Intersects all 

the members of K. 

3 , Let K be a family of compact convex sets in E and let C c E 
be a compact convex set. If for every d+1 members of K there exists a 
translate of C intersecting all of them, then there exists a translate of 
C which intersects all the members of K, A valid statement results also 
if "intersecting” is replaced throughout by "containing," or by "contained 

in." 

A special case of this result was first established by Vincensini [l] . 

The following simple proof is due to Klee [2]. 

For each € K, let = {x € E*’ 1 (x + C) 0 ^ 0 ] , Each la 

convex, and each d+1 or fewer of the have a point in common. By 

Kelly's theorem, there is a point x common to all C^, so (x + C) 0 f 0 

for each K. € K. 

^ d 

4. Let A and B be two finite sets of points in E . Then A and 

B may be strictly separated by a hyperplane if and only if for each 

X c A U B such that X contains at most d H- 2 points, there exists a 

hyperplane strictly separating X fl A from X fl B. 
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4 

5, Let 3 ^ he a finite family of parallel line segments in E , 
three of which admit a common line transversal (i.e., a line which intersects 
each of them) . Then there is a line transversal common to all members of 
Assume with no loss of generality that has at least three members 
and all segments are parallel to the Y-axis. For each segment L € let 
denote the set of all. points (a.b) 6 such that L is intersected by the 
line y = ax + b. Each is convex and each three such sets have a common 

point, so by Kelly's theorem there is a point (ao-bg) ^ 
v = a X + b^ is a connnon transversal. 

Note that this may be extended in several ways. Using the first exercise 
above, if all the line segments are closed, then the finiteness restriction of 
£ may be relaxed. Instead of using transversal lines, we may use transversal 
polynomials of degree n, provided we demand that each (n+2) of the given 
segments have a common n-polynomial as transversal. 

Also see Rademacher and Schoenberg [1] for other applications of Kelly* s 

theorem. 

The general idea of applying Kelly's theorem is to prove something for 
small sets or fewer elements) of certain objects in order to conclude 

that it is true for the set of all the objects. It has frequently happened 



that the collection of objects considered in a particular problem was too 
small to be suitable for an inductive proof. For example, if we wished 

2 

to prove Kelly* s theorem for the family of all elliptical domains in E , 
any attempt to work with the foci or other geometric properties of the 
ellipse would probably end in a disaster. 
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Discussion . 

Klee mentioned that a negative form of Helly*s theorem is frequently 
very useful > particularly in analysis: If K is a family of convex sets 

(with suitable restrictions) with empty intersection, then some small sub- 
family must already have empty intersection. 

One other application of Helly*s theorem is the following (see DGK [1] 
Theorem 2,5); Let X be an infinite compact set in (a d-dimensional 

art gallery) , and suppose that for each d+1 points of X there is a point 
X 6 X from which each of these d+1 points are visible (i.e,, [xjU^^^J c X) , 
Then X is starshaped (i.e,, there is some x 6 X from which each point of 
the art gallery is visible) . 

This problem leads directly to many unsolved "illumination" problems, 

A famous one mentioned by Klee was: Given a triangular billard table, is 

there some point and some direction so that a (one-pointed) billard ball will 
describe a path which is dense in the triangle? Is it true that from all 
points there exists a direction which gives such a path? For general tri- 
angles almost all such meaningful questions of this type are unsolved. 

It was pointed out that various theorems similar to Exercise 5 are 
known where the segments of that exercise are replaced by various other kinds 
of sets. These results lead many (including Kelly himself) to look for a 
version of Kelly* s theorem that was valid in a much more general setting. 

One such generalization was given for non-convex sets and is called Kelly's 
topological theorem. 

It was noted that Grunbaum has generalized Kelly's theorem in the follow 
ing way (see Grunbaum • Let K be a finite family of convex sets xn 
E*^, and suppose 0 gk g J. If each h(k) or fewer sets of K have an 
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intersection which is at least k- dimensional, thee 0 M is k- dimensional, 
where 

h(0) « d + 1 

h(k) «2d + l- k (ISk^ d-1) 

h(d) St d + 1 

Note that this is just Kelly' s theorem if k « 0 and Vincensini's theorem 
if k = d. 
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6, Extreme and exposed points ; faces arid poonen^. 

Let K be a convex subset of A point x € K is an Ktreme £oint 

of K provided y.z 6 K, 0 < \ < 1, and x = \y + (l-\)z Imply x = 7 = a- 

In other words, x is an extreme point of K if it does not belong to the 

relative interior of any segment contained in K. The set of all extreme points 
of K is denoted by ext K, Clearly, if x 6 ext K then x i conv(K ~ {x}). 
Let K be a convex subset of e"^. A set F c K is a face of K if 

either F = 0 or F = K, or if there exists a supporting hyperplane H of K 
such that F = KnH, 0 and K are called the improEer faces of K. The 
set of all faces of K is denoted by 3i(K) . A point x € K is an racj 20 S^ 
point of K if the set [x] , consisting of the single point x, is a face 
of K, The set of all exposed points of K is denoted by exp K. If K is 
a closed convex set, it is obvious that each F € 3!(K) is closed. The nota- 
tions ext K, exp K and 3(K) will in the sequel be used onji for closed 
convex sets K. Note that the definition of exposed point involves the 
"outside of K," i,e,, supporting hyperplanes, while the definition of extreme 

point involves the "inside of K," i.e., segments in K. 

The following statements result at once from the definitions: 

1, If F e »(K) and if K' c K is a closed convex set, then 

F n K' € 3i(K’) , 

2. If F € S(K) and if x € F, then x € ext K if and only if 
X e ext F; thus, if F € 3(K) , then ext F - F H ext K, 

e 2 

3 • For every convex K E we have exp K cn ext K • If K cz E is the 

m 

convex hull of a circle and an exterior point 
Xj then the point y of tangency is an ex- 
treme point, but not an exposed point, because 
the only face of K containing y is the 
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segment [x,y]. Note that z is an exposed point with a unique supporting 
hyperplane while x admits many supporting hyperplanes. 

4. Let K be a closed convex set in let x € K, and let B be 

a solid ball centered at x. Then x € ext K if and only if x € ext(K fl B) , 

while X € exp K if and only if x € exp(K H B) . 

The next two results explain the role of the extreme points . 

d 

5. Let K be a compact convex subset of E . Then K = conv(ext K) . 

Moreover, if K = conv A then A Z) ext K, 

Proof. Clearly, K 3 conv(ext K) . In order to establish KC conv(ext K) , 

we use induction on the dimension of the convex set K, the assertion being 

obvious in case dim K is -1, 0, or 1. Without loss of generality we assume 

= aff K. Let x € K. If x ext K, let L be a line such that 

X € rel int (L H K) . Then L H K is a segment [y,z], where obviously 

y,z € bd K. Since through each boundary point of the convex set K there 

passes a supporting hyperplane, there exist faces and of K con 

taining y and z respectively. Now, the dimensions of F^ and F^ are 

smaller than dim K; by the inductive assumption, F^ = conv(ext F^) and 

F = conv(ext F ). Using statement 2 (above) we have x € conv(y,z) c 
z z 

conv (F U F ) c conv (conv (ext F ) U conv (ext F^)) c conv (ext F^ U ext F^) c 
y Z J 

conv(ext K) , as claimed. The last assertion of the theorem being obvious, this 

completes the proof of 5. 

An analogous inductive proof yields also 

6. Let K be a closed convex subset of E*^, which contains no line. 

Then ext K 0. 

Regarding exposed points, we have 

7, Let K c: E^ be a compact set and let H ~ {x € E j (x,u) > O'} 
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(where u is a unit vector) be an open half space such that H'*’ H K «( . 

Then h'' fl exp K # (J. 

Proof . Let K* = H*" 0 K, let y £ K*. and denote by e the distance 

from y to bd H*' and by 8 the number 6 = max[p (x,y-eu) 1 x € K 0 bd H ) . 

8 ^ + 

Let z = y - pu, where p is some fixed number satisfying P ^ 2e 

Denoting by B the solid unit ball centered at the origin, let 
p, = infix >0 1 z + XB => K*), Clearly, p S p. Then by the compactness of 
cl K*. we have z + p,B 3 K* and C = (cl K*) 0 bd(z + p,B) 0. We claim that 

•-j- 

C n bd H*" = 0. Indeed, assuming the existence of a point v 6 C D bd H , we 
would have 8 ^ S (pCv.y-eu))^ = . (g.e )2 3 2 pe - / which implies 

2pe S 8 ^ + in contradiction to the choice of p. Therefore, C C K* 

but clearly each point of C is an exposed point of z + p,B and therefore 

also of cl K* end of K, as claimed. 

Lemma 7. together with 4, 5, and 6 above. 8 from Section 3. and 4 from 

Section 4, imply Straszewicz' theorem: 

8 , If K c is a compact convex set then cl conv exp K - K, 

Indeed, let K' = cl conv exp K; obviously K' c K. If K' ^ K, then 
there exists an x € K such that x K* . Since the compact convex sets 

[ k ] and K* may be strictly separated, there exists an open half space 
such that H+ n K 0. But then H+ 0 exp K 0 by Theorem 7, contradicting 

the definition of K' . 

The reader is invited to prove 

9. If KC is a closed convex set, then exp K c ext K c cl exp K; 
therefore, if K contains no line, then exp K 0 0, [Hint: K closed, 

bounded and convex in e'^ implies K is compact , (Note that this may not be 
true in more general settings.) Hence K = cl conv exp K c cl conv cl exp K - 



o 
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conv cl exp K = K. Thus by the second part of Theorem 5, ext K c: cl exp K,] 

To get an example in of a set K for which exp K is not closed, 

consider conv (C U S) where C is a circle and S is a closed line segment, 
not in the plane of C, whose relative interior meets C. 

Regarding the family I?(K) of all faces of a closed convex set K we 

have; 

10. The intersection F = of any family [fJ of faces of a 

closed convex set K is itself a face of K, 

Proof. If F = 0 the assertion is true according to our definitions; 

thus we shall consider only the case F ^ 0. Without loss of generality, we 

may assume that the origin 0 belongs to F, and that each F^ is a proper 

face of K. Then the face F. is given by Fj^ = K fl {x ] <x,Uj^> - 0} where 

u is some unit vector such that K ct {x | {x,u. ) SO}. Let 
i 

H = [x I <x,v> = 0} where v = then clearly K c {x j <x,v> ^ 0}. 

Since 0 € K n H, this implies that H is a supporting hyperplane of K. 

Now, if X e F, then <x,up = 0 for all i and therefore <x,v> = 0; 

hence x € H 0 K and thus F ci H H K. On the other hand, if x e K - F, 

then <x,u.) > 0 for at least one j and <x,v> ^ <x,u^> >0; thus 

X 0 H n K. Therefore, F = H fl K and F is a face of K, as claimed. 

The family [F^] in Theorem 10 may be infinite; in this case the face 
of smallest dimension obtainable as an intersection of finite subfamilies of 

{F.} equals the intersection of all members of {F^}* 

It is easy to find examples which show that the following situation is 

possible; K is a compact convex set, C G 3^(K) and F G 5(C), but F 0 5(K) 

Consider the example in 3 above; F = [y] and C = [x,y]. 

This observation leads to the following definition; 
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A set F is called a poonem* * of the closed convex set K provided there 

exist sets F^ such that F„ = F. F^ = K. and F..^ € 5(F,) for 

i = l,,,.,k. Alternatively, a set F is a poonem of K provided 
p =s K n sff P K P is convex. 



By this definition, each face of K is also a poonem of K but the 
converse is not true in general. However, each (d-1) -dimensional poonem of 
KCe"* is a face (facet) of K, so the facets and (d-l)-poonems coincide. 
Clearly, each poonem F is a closed convex set, and ext F = F 0 ext K, 

Thus each extreme point of K is a poonem and conversely, each 0-dimensional 
poonem is an extreme point. One of the characteristics of poonems that makes 
them a valuable tool is Just that fact, namely, the poonems are related to 
extreme points analogously to the way faces are related to exposed points of 



K. 



The set of all poonems of 
The reader is invited to 



a closed convex set K 
deduce from Theorem 10 



shall be denoted by P(K) 
the analogous result! 






11. The 



intersection F = n, F^ of a family £f^} of poonems of a closed 



convex set K is ia iP(K.) • 

12. If F € P(K), then P(F) = [P € P(K) \ P c F} . 

13. If F € SCK) and P € P(K) , then P H F € P(F) and P PI F € 3f(P) . . 



Exercises . 

1, A convex cone has at most one exposed point, 

2, Let K denote a compact convex set. Show that if dim K S 2, then 

ext K is closed, but exp K is not necessarily closed. Find a K <= such 

* "Poonem" is derived from the Hebrew word for face. 
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that exp K 5^ ext K ^ cl exp K. 

3. If the family 3f(K) of all faces of a closed convex set K is 

partially ordered by inclusion) then is a complete lattice. (For 

lattice- theoretic notions see, for example, the books of Birkhoff or Szasz^) 

The same is true for the family IP(K) of all poonems of K, (In both cases 
the greatest lower bound of a family of elements is their intersection^) 

4. If K is a closed convex set and if C is a subset of K, show 
that C € IP(K) is equivalent to each of the following conditions: 

(i) C is convex and for every pair x,y of points of K either the 
closed segment [x,y] is contained in C, or else the open interval (x,y) 
does not meet C. 

(ii) C = K n aff C and K~ aff C is convex. 

(iii) There is a flat L for which C = K fl L and K ~ L is convex. 

(iv) There exists an x G K such that C is the maximal convex subset 

of K satisfying x G rel int C. 

5. If F. G ^(K) for 0 ^ i = n and if F c U?_ F. , then there exists 

i . 1 ^ i ^ n, such that c: F. . The same is true if all F belong 

0 0 " ^ ^0 

to iP(K) . 

6. Let K and K« be closed convex sets. Prove: 

(i) If F^ G 3^(Kp for i = 1,2, then F^ fl F^ G 3(K^ fl K^) . 

(ii) If F^ G POO for i = 1,2, then F^ fl F^ G P(K^ (1 K^) . 

(iii) If F G IP(K^ n K^) there exist F^ G P{K^) and F^ G P{K^) 
such that F = F^ n F^. 

(iv) If rel int fl rel int 0 and if F G 3(K^^ fl Kp j there 

exist G 3(Kj^) and F^ G ^(K^) such that F = F^^ fl F^. 



90 



(v) Find examples showing that (iv) is not true if 
rel int fl rel int K 2 = 0 . 

7. Let T be a non-singular projective transformation. 



_ Ax + b 

^ " <c,x> + 6 ’ 

and let H*" be the open halfspace = {x € E | (c,x) + 6 > 0} . Prove: 

(i) If A is any subset of then T(conv A) = conv TA. 

(ii) For every compact convex set K for which T is permissible, 
g(TK) = [TF 1 F € 3J(K)} and P(TK) = {TF ] F € P(K)}. 



Discussion . 

Yale pointed out that the length of the chain definition 

of poonem can be of length at most d+1 if F^ = K ci E , On the other hand, 

in each space there are examples of extreme points Cx} = Fq in a set 

K = F for which the length of each such chain is exactly d+1. The example 
k 

2 

in 3 above shows this in E . 

Klee remarked that contrary to what might be supposed, the relation 
between extreme points and exposed points in the boundary can be quite complex. 
For example, in there exist sets which have the following properties: 

15 the non-extreme boundary points are dense in the boundary, 

2) the extreme but not exposed points are dense in the boundary, and 

3) the exposed points are dense in the boundary. 

For a number of years it was thought that in some reasonable sense almost all 

extreme points should be exposed. In the 2-dimensional example in 3 above, 

2 

there are only two extreme points which are not exposed. And in E it is 
easy to see that there can be only countably many non-exposed extreme points. 
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since each one would have to be the end of some line segment contained in the 
boundary of the convex body. Both Klee and Cheque t tried to prove a higher 
dimensional version of this theorem and failed. A few years ago Corson [l] 
showed a 3-dimensional example of a convex body in which almost all (in a 
sense that can be made precise) of the extreme points fail to be exposed. 

For open problems in this area, see Choquet-Corson-Klee [l]. 
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7. Unbounded convex sets . 

The present section deals with some properties of unbounded convex sets . 
1, A closed convex set K ci is unbounded if and only if K contains 

a ray. 

Proof. We shall consider only the non-trivial part of the assertion. 

Let Xq 6 K, and let S = bd B denote the unit sphere of centered at 

the origin. For each \ > 0 we consider the radial projection 

P^ = tt(K n (Xq + \S)) of the compact set K fl (Xq + XS) onto x^ + S, the 

point Xq serving as center of projection.* Since radial projection is 

obviously a homeomorphism between x^ + XS and x^ + S, the set P^ is 

compact. If K is unbounded, then " 2 ^^ for every X > 0. Since K is 

convex and x^ G K, we have P c P whenever [j, ^ X. Therefore, 

n P ^ 0. If y point of this intersection, the ray 

X^O X ® 

{XyQ + (I-X)Xq I X S 0} is clearly contained in K. This completes the 
proof of 1 . 

2. Let K c be closed and convex, let L = [Xz | X § O] be a ray 
emanating from the origin, and let x,y G K. Then x + L c K if and only if 

y + L c K. 

Proof. Let x + L c K, and let y + Xz G y + L be given, X S 0. 

For 0 < [Jb < 1, consider the point v^ == (l-iJ.)y + |j.(x + — ) € K. Since 
6(v ,y + Xz) = 6(0,iJb(x-y)), the distance between y + Xz and v^ is 

|Jb 

arbitrarily small provided [l > 0 is sufficiently small. But K is closed. 



* If X € the radial projection rr, with center of projection x^, 

0 ^ 

of {Xq] onto the unit sphere x^+S is defined by tt(x+Xq) = x^ + *pj| 
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and therefore v € K implies y + Xz € K, Since x and y play equivalent 
roles, the proof of 2 is completed, 

A closed convex set C c E is a cone with apQX Xq provided Xq + C 
is a cone with apex 0 . A cone C with apex x^ is pointe d provided 
Xq € ext C. Let C be a cone with apex 0. The following assertions are 

easily verified: 

(i) The apices of C form a linear subspace C fl (-C) of E . There- 
fore, either C is pointed, or there exists a line, all points of which are 

apices of C. 

(ii) C = C + C = XC for every X > 0. 

Conversely, if a non-empty closed set C c has property (ii) , then 
C is a cone with apex 0. 

The intersection of any family of cones with common apex x^ is a cone 
with apex x . Therefore it is possible to define the cone with apex x^ 
spanned by a set Ac as the intersection of all cones in E which 

have apex Xq and contain A. Though this notion is rather important in 
various investigations, we shall be more interested in another construction 

of cones from convex sets. 

Let K be a convex set and let x € K, We define 

cc K = fy 1 X + Xy € K for all X § O] , 

Clearly, cc K is a convex cone which has the origin as an apex. Lemma 2 
implies that for closed K we have cc^K = cc^K for all x,y € K, Thus the 
subscript X is unnecessary and may be omitted. The convex cone ccK is 
called the characteristic cone of K, Using Lemmas 1 and 2, we obtain the 

following result: 
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3, If K c is a closed convex set, then ccK is a closed convex 

cone; moreover, ccK 4 [O} if and only if K is unbounded, 

A closed convex set K shall be called line-free provided no (straight) 
line is contained in K. Using this terminology. Theorem 6.6 and the last 
part of Theorem 6,9 may be formulated as: If K is line-free, then 

ext K exp K. It is also clear that every line-free cone is pointed. 

Returning to Lemma 2, we note that it immediately implies: If L is a 
linear subspace of such that x + L c K for some x, then y + L c K 

for every y € K. Therefore, the following decomposition theorem results: 

4, If K c E^ is a closed convex set, there exists a unique linear 
subspace L c E^ of maximal dimension such that a translate of L is con- 
tained in K. Moreover, denoting by L* any linear subspace of E*^ comple- 
mentary to L, we have K = L + (K fl L*) , where K fl L* is a line-free set. 

Some information on the structure of line-free sets is given in the 
following theorem. 

5, Let KC E^ be an unbounded, line-free, closed convex set. Then 
K = P + ccK, where P is the union of all bounded poonems of K. 

Proof . We use induction on the dimension of K, the assertion being 
obvious if dim K * 1. If dim K > 1 and if x 6 K, let y 6 rel bd K and 
z 6 ccK be such that x ® y + z. (Since K is line-free, such a choice is 
possible; indeed, for any t 6 ccK, t ^ 0, there exists a \ > 0 such that 
X - \t 6 rel bd K,> Let F be any proper face of K such that y 6 F. If 
F is bounded then F c P and x € P + ccK. If F is not bounded, the 
inductive assumption and dim F < dim K imply that y « v + w, where 
w € ccF and v belongs to P * > the union of the bounded poonems of F . 
Since P' c P, ccF c ccK, and ccK is convex, it follows that 
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xay + 25 = v+ w+ z6P* + ccK + ccK c: P + ccK. Since obviously 
K 3 P + ccK, this completes the proof of Theorem 5 . 

Since for each bounded poonem F of K, we have ext F = F fl ext K and 

F » conv ext F, Theorem 5 implies; 

6, Let KC E be a line-free, closed convex set. Then 

K s ccK + conv ext K. 



Exercises . 

1, Show that Lemma 1 is valid even without the assumption that K is 
closed, 

2, If K is any convex set in E , show that x,y € rel int K implies 

cc K « cc K, Moreover, for x € rel int K the characteristic cone cc K 
X y 

is closed. 

3, Show that the decomposition Theorem 4 holds also if K is a rela- 
tively open convex set, 

4, Let K C E^ be a closed convex set; then ccK is the maximal 

(with respect to inclusion) subset T c E*^ with the property; For every 
X € K, X + T c K. 

5, Let K c E^ be a closed convex set; then ccK = "j^x 6 E | (x,u) S 0 

for all such u that there exists an o? with K c {z ] (x,u> 

6, Let K c E be a closed convex set such that 0 € rel int K, 

Prove that 

ccK ■* n” - (^) . 

n«l n 

7, Using the notation of the decomposition Theorem 4, let L** denote 
another linear subspace of E complementary to L, Show that L^^ fl K 
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is an affine image of L* fl K. 

8. If K c: is a line-free, closed, convex set, then there exists a 

hyperplane H such that H fl K is compact and dim K « 1 + dim(H fl K) . 

9. If K c: is a closed pointed cone with apex Xq> there exists 

a hyperplane H such that H PI K is compact and K is the cone with apex 

Xq spanned by H D K. 

10. Prove the following results converse to Theorems 5 and 6, 

(i) If K is an unbounded, line-free, closed convex set and if 

K = C + P, where C is a cone with apex 0, then P contains all bounded 

poonems of K. 

(ii) If K is a line-free, closed convex set and if K = C + P, where 
C is a cone with apex 0 and P ^ 0 is a closed, bounded, convex set, then 

C = ccK and P =5 conv(ext K) . 

11. A convex set K is called reducible provided K = conv rel bd K. 
Prove the following results; 

(i) If K is a closed convex set then K is the convex hull of the 

union of all irreducible members of P(K) . 

(ii) Each irreducible closed convex set is either a flat or a closed 

half-flat. 

12. Show that each d-dimensional closed convex set is homeomorphic 

with one of the following d+2 sets: (i) a closed half space of E ; 

(ii) the product X for some k with 0 ^ k ^ d, where B 

denotes the k-dimensional (solid) unit ball. 
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Discussion, 



1 



Since Sections 6, 7, and 8 were presented in the same lecture, many of the 
comments at the end of each section apply elsewhere as well. 

There is no standard way of defining the ’Vertices” of an arbitrary 
closed convex set. If each exposed point is a ' ertex, then each boundary 
point of the circle would be a vertex. One definition could be: a vertex of 

K c is an exposed point which admits d independent supporting hyper- 
planes, Karlin and Shapley [l] have a discussion of degrees of exposure 
of points, A vertex to a set at a boundary point could be defined as a point 
which is the intersection of all supporting hyperplanes at the point , 

Klaa remarked that for a certain class of Banach algebras with unit, the 
unit itself is a vertex of the unit ball of the Banach algebra, and many use- 
ful results can be derived from this fact. See Bohnenblust and Karlin 

There are also simple examples in 2-dimensional rational space of closed 
convex sets which are linearly bounded but unbounded. This type of example 
can give the student a good feeling for the type of complication which in the 
real case arises only in the infinite dimensional spaces , This example also 
shows why "compact” is needed in many of these theorems rather than just 
"closed and bounded,” In the rational plane consider all points between the 
"lines” through (-1,0) and (1,0) with a slope Since all lines are 





"rational," this unbounded set is linearly bounded. In this set we can get a 
sequence of decreasing nested closed non-empty sets whose intersection is 

empty. 

Hausner pointed out that each d-dimensional set must have at least d+1 
exposed points . 

Koehler and Grunbaum pointed out that the best generalization of polytopes 
to countably-faceted bodies was the notion of quasi-polytopes, where the inter- 
section with any polytope is a polytope. Demanding that P be the countable 
intersection of closed halfspaces is not a strong restriction, since each 
closed convex set K c: is such a set. A wide variety of sets is also 

obtained by allowing finite intersections of either open or closed half spaces. 

Johnson and Prenowitz said it would be nice to get at the geometric con- 
tent of these theorems. Most topological concepts, like compactness, can be 
expressed in geometric terms. In the rational example given above, a ray from 
an interior point does not have to intersect the boundary because of lack of 
completeness. Perhaps a geometric condition like "Any ray from an interior 
point meets the set of boundary points" should be imposed. Klee mentioned that 
in any finite-dimensional vector space over any ordered field, a line-free, 
segmentally-closed convex set is the convex hull of its extreme points together 
with its extreme rays. The more general question here appears to be "How far 
can you go with elementary geometrical concepts before you must impose further 

topological properties?" 



99 



8. Polyhedral sets . 

A set K c is called a polyhedral s« provided K Is the inter- 

_d 

section of a finite family of closed halfspaces in E . 

Polyhedral sets have many properties which are not shared by all closed 

convex sets. One of the most important of these properties is: 

1. Each poonem of a polyhedral set K is a face of K. 

Before proving Theorem 1, we note a few facts about polyhedral sets. 

Let Ht = lx e s'* I <x,u.) SQf.], 1 S i § ", be halfspaces, and let 
K = Ht. Without loss of generality we shall in the present section 

assume that dim K = d; we shall also say that a maximal proper face of K 
is a facet of K. The family {H+ 1 1 S 1 S "} calUd irredunda jit pro- 
vided 7^ K for each i = 1 , 2, ..., n. 

Denoting = bd n't = {x € E** ) <x,Uj^> = a] , we have 

2. If K = n“_, Ht, where {n't 1 1 S j S n} is irredundant, then 

J”* J 

F. = H. n K is a facet of K. 

This follows at once from the observation that 0 int (J which, 

in turn, is a reformulation of the irredundancy assumption. The same assump 

tion also implies 

3 bd K = 1 F.j where F. = H. fl K are the facets of K. 

In particular, for each proper face F of K there exists a facet 



F of K such that F C F . . 
i ^ 

Let F = H. n K be a facet of K. Then 
i 1 



/ V + 

F - H n fn = n (h- n np . 

F - H n 1 J 



Thus F. is a polyhedral set, namely the intersection of the sets H H ^ > 

1 ^ j ^ n, each of which is either H. or a halfspace of the (d-1) dimen- 
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sional space H. . Therefore, by Theorem 3, each facet F of is of the 



a suitable j. Thus: 

4. A facet of a facet of a polyhedral set K is the intersection of two 
facets of K. 

Now we are ready for the proof of the following result which, in view of 

Theorem 6.10, clearly implies Theorem 1. 

5. Every poonem F of a polyhedral set K is an intersection of facets 

of K. 

Proof . We shall use induction on dim K, the assertion being obvious 
if dim K = 1. If dim K > 1, let x G rel int F. By Theorem 3, there 
exists a facet F^ of K such that x G F^, i.e., Fez F^. Theorem 6.12 
then implies that F is a poonem of F^. Using the inductive assumption we 
see that F is an intersection of facets of F^. Since each facet of F^ 
is the intersection of two facets of K, this completes the proof of Theorem 

5. 

We mention also the following immediate consequence of Theorem 5. 

6. If K is a polyhedral set, then the family 3?(K) is finite. 




Exercises . 



1. (See Theorem 7.4 for the notation.) Show that if K is a poly- 



hedral set 



then 



K = [x G 1 <x,u^> ^ Qfj,} 

L = [x G E^ I <x,u^> = 0}. 




] 
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2. Show that if K is as above, then 

cc K = 1 <x,u^> § 0}. 

3. Show that if K is as above, and if p € K satisfies <p,u^> = 
for 1 ^ i ^ m, and <PjU^) > for m < i ^ n, then 

cone K = n , , {x ^ E j (xji? .) ^ 

p i=i 1 ^ 

4. Show that every affine map of a polyhedral set is a polyhedral set, 

and that K, + K. is a polyhedral set provided and K 2 are polyhedral. 

JL ^ 

Find a polyhedral set K and a projective transformation T (permissible 
for K) such that TK is not a polyhedral set. 



9. 
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Part II 



The material of the second week of these lectures is a departure into a 
more specialized area than that of last week. The present material would not 
be suitable for a year-long course in undergraduate geometry, but it would be 
appropriate for a number of one-quarter courses or honors papers at the senior 
level. In the second part of these lectures I will try to present more ideas 
and results, and therefore fewer proofs than last week. Much of this material 
has relatively easy proofs provided the correct sequence of proving these 
results is found. In general, since we work with finite systems, less back- 
ground in analysis and topology is required. A student can get by with only 
an algebra background. 

Unfortunately, the terminology in this area has not been standardized. 

Throughout this part of the notes the (perhaps unbounded) intersection of a 

finite number of closed halfspaces will be called a polyhedral A d-pql^- 

d 

tope will always denote a bounded d-dimensional polyhedral set in E . 

These definitions, as well as most of the other notions, results and 
proofs of Part II are presented m detail in the book Grunbaum []l] . 
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10 . Number of facets of polytopes . 

As mentioned before, the timing problem for computer programs has led to 
the general question "Given a poly tope in with f facets, how many 

vertices may it have?" Until about five years ago the answer was unknown 
except for certain easy cases. Thanks to some new ideas by Klee [2] in 1962 
the problem is now almost solved. The little residue of the problem that is 

Still left, however, is very challenging and strange. 

Rather than attack the problem as stated, we will use the dual procedure, 

namely, find the maximum number of facets of a poly tope given its dimension 
and the number of its -vertices. These are really equivalent problems because 
of the existence of duality among polytopes. Here d = dimension of the poly- 
tope P, V = number of its vertices, p,(v,d) = maximum number of facets of a 
polytope in with v vertices , To avoid trivial cases (analogous to many 

problems with the empty set) let us from now on assume d S 2, This keeps a 
result like "p,(v,d) is a strictly increasing function of v" from being 
false or ill-defined. We also let f^(P) = the number of k-faces of P, 

-1 S k S d, and if P is understood we write f^^. The fxrst result is 

immediate , 

1. li(v,2) = V 

2, In E^ f 2 ^ n(fo,3) = 2fg - 4, 

For a proof of 2, we just combine the following facts. 

( 1 ) fo - fi + f^ = 2 (Euler's equation) 

(2) Assume without loss of generality that P is a simplieia l. polytope (i,e,, 
all its facets are (d-1) -simplices = triangles) 

(3) counting the incidences of edges and facets of simplicial 3 -polytopes 
in two different ways, 2f^ = Sf^. 



o 



From (1) and (3), £2 = 2£q - 4 - ia(£q,3) . 

(4) Note that £or each £q = 4 we can get examples which show that these 
upper bounds are assumed. 

In higher dimensions we try to go through these same steps but in general 
their proofs are harder, and together they might not always get the result. 

Further they £requently lead to unsolved problems. 

We first turn to a discussion of Euler's formula, the analogue of step 

(1) above . 

d i 

3. For each d-polytope, ® * 

Since £ ^ ^ d-polytopes, the £ormula frequently is 

given the form (-l)"f . = 1 - 

Schiafli as early as 1852 but the first published proofs of it came in the 
early 1880 's when a number of independent proofs were published, all of which 
were incorrect (or rather incomplete) in their reasoning. (See GrCnbaum [1], 
Chapter 8.) The basic idea in E^ of each of these proofs was to start with 

one facet (and its associated edges and vertices) and note that 

f f + f = 1 holds. This formula remains valid if a second adjoining 
"^0 ■ 1 2 

facet is added. These authors assumed that it is possible to always suc- 
cessively add facets of P (and their faces) in such a fashion that the 
newly added facet meets the already present 2-complex ((d-1) -complex) in 
a simply connected polygonal line segment ((d-2) -complex) . This continues, 

preserving the formula fg " ^1 ^2 “ ^ ^i=0 ~ 

the last facet which must then be added individually, thus giving Euler's 

formula. The critical assumption of these proofs is certainly not obvious, 

even for d = 3. In case d = 3 tne assumption may be established by an 

argument using the Jordan curve theorem. Pot d & 4 the problem is still 
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open and in fact, certain examples cast doubts on the validity of the assump- 
tion. There are simplicial complexes in for which the assumption is 

false, though none of these is known to correspond to a polytope. While 
Poincare seems to have given the first valid proof in 1899, the old proof 
continued to appear in books (e.g., Sommerville [2]). Most modern proofs 
used heavy topological machinery, and it was only in 1955 (Hadwiger [1]) 
and 1963 (Klee [l]) that elementary proofs were given, and even these had 
algebraic overtones. The following proof is completely elementary and stays 
Within the framework of convex polyfopes: 



(1) Euler's equation obviously 
holds for d = 1 and d = 2, 
thus starting the induction 

on d . 

(2) It holds if P is a 
prismoid , i.e., the convex 
hull of two polytopes 

and P^, for which 




(aff Pp) n (aff P 2 ) n P = 0 . 

To see this , let P^ be the intersection of a hyperplane H with 
int P , where H is a hyperplane from the pencil of hyperplanes determined 

by affP^ and aff P^- Then Pq (P) = f q *0 

1 g k S d-1, a k-face of P is either a face of Pq or P 2 

it has vertices in both P^ and P 2 , in which case it corresponds to a 

(k-l)-face of Pj^. Thus fj^(P) = ^k^2^ ^k-1^1^ 



1 g d-1. Summing over k we get 

(-i)\(p) = + ^k<^2» 




(-i)’'’^'-fk(^i> • 
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By the induction applied to P^> we see that Euler's formula holds: 

^ k =0 = 2 - (1 - = 1 - (- 1 )^. 

(3) Euler's formula is similarly easy to deduce for any d-simplex, or any 
d-pyramid, i.e., the convex hull of a (d-1) -polytope and a point, 

(4) Let P be any d-polytope, and let H be any hyper plane which meets 

int P and which contains 

exactly one vertex x of 

P. Let P"*" = h"*" n P and 

p“ = h” n P. If Euler's 

formula holds for the 

+ 

d-polytopes P and P 
and for the (d-1) -polytope 

p = ?■*■ n p" = H n p, 

then it is valid for 





P = P'*' U P‘ . 

The proof is analogous to that of part (2) above when we notice that 
the following formulas hold: 

fjCP) = fiff'*') + ^ 

and if 2 ^ k ^ d-2, 

fk(P) = + fk<^‘> - - *k-l<^0>- 

(5) If P is any poly tope in E“, there is a hyperplane H such that 
no translate of H contains two or more vertices of P. 

(6) Let P C be given, let be parallel hyper planes each of 

which meets P in exactly one vertex, and let * * * *^v-l parts 

of P into which the divide P. Note that P^ and P^^^^ are 



i 

t 

i 

! 
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d-i>yramids, while P2’"”^v-2 are prlsmolds . By (2) and (3) Euler's 
formula holds for each P.. By repeated use of (4), Euler's formula holds 

for P = P • This proves the theorem. 

i=l i 

We next turn to the reduction assumption in part (2) of Theorem 2 
above . 

4. In determining the maximal possible number of facets of a d- 
polytope P with v vertices, we may assume without loss of generality that 
P is a simplicial polytope . 

An argument that comes to mind at once to establish this result is of 
the following type; If there is a facet F which is not a simplex, then F 
admits a (d-2) -diagonal G. Including in the set of intersiictlng half spaces 
which determine P a halfspace which is close to F"*" but rotated slightly 
about G we will "slice off" a piece of P. For example, if F is the top 
face of a cube, ther 





If the amount "cut off" is small, so that the 
same, then the number of facets is increased, 
gives a simplicial polytope and 4 is true. 
However, the following example shows that 
it is not obvious that this sort of thing 
can be done without an increase in the 
number of vertices . 



number of vertices remains the 
and repeated such operations 
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Another (and successful) «ay to approach the proof of 4 is by the use of 
the following theorem (Eggleston-Grunbaum-Klee [l]) . 

5 . For each d-polytope P, there exists an e> 0 such that if 
p' is another d-polytope with p(P,P') < «> then 

The Hausdorff metric p is defined by 

p(P,P’) = inf{a > 0 1 P c (P'HoiB) and P' <= (P-wB)} 

where B is the unit ball at the origin. 

This theorem gives us the reduction to simplicial polytopes as follows . 

If P is not simplicial, then there are too many vertices on one facet. 

It is then possible to move one of the vertices away from P by an amount 
sufficiently small to assure that all other vertices remain vertices . This 
"pulling out" operation does not increase the number of vertices. The com- 
binatorial structure might have changed quite a lot, but we are interested 
in only f^_j^(P'), and Theorem 5 implies 

the distinguished vertex moves by less than e(P). If f still has non 
simplicial facets we continue this "pulling out" process, at each step 
changing each successive polytope by a very small amount (with respect to 
the Hausdorff metric) , keeping the number of vertices the same, and increasing 
(non-strictly) the number of facets until we obtain a simplicial polytope. 

This establishes Theorem 4. 

It is well-known that if p(P,P') is small, and if \ denotes the 
k-dlmenslonal content of the k-faces, then f^j(P) and Yj(P') are close, 
and Y , (P) and Y^_^(P') are close. Easy examples show that in general 
the (d-2) -dimensional contents need not behave similarly: in E let 

be a cube and P' the similar cube with one of its edges "sliced off." 
However, it may be proved that Y^(P) is a lower semicontinuous function of 
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P for all k, and that Y (P) may be extended to the family of all compact 

iC 

convex sets under preservation of this property (allowing value +°°) 
(Eggleston-Griinbaum-Klee [l]) . This observation leads to a wealth of isoperi- 
metric type problems concerned with d-polytopes such as the following. For a 
g,’ven d-polytope P whose k-faces have k-dimensional content 
what can be said about Y. (P) for i ^ k, 1 ^ i g d? In particular, no 
one has pioved that the convex set P in with largest volume, for which 

=s 1, must be a poly tope and not have an infinite number of vertices, 

nor is the maximal volume known. 

We now turn to a generalization of the third step of the proof of 

3 

Theorem 2. We noted that for simplicial polytopes in E , 2f^ = Sf^* 

The generalization again counts certain objects in two different ways. For 
0 ^ijj = d, and for a d-polytope K, let g. .(K) denote the number of 
incidences of an i-face of K with a j-face. The object we count in two 
different ways is 

(*) OO 

Recall that when we changed the problem from that of finding a bound on the 
number of vertices to that of finding a bound on the number of facets of a 



polytope, we mentioned that it was the natural duality that made these 
problems equivalent. We again make use of this duality. A k-face (and its 
faces) is itself a k-polytope which satisfies Euler's formula. In the dual 
setting the k-face becomes a (d-l-k)-face of the dual polytope and each 
i-face of the k-face becomes a (d-l-i)-face of the dual which contains the 
(d-l-k) -face , Let h. (F) denote the number of i-faces of K incident 
with the k-face F of K. Because of this duality, these must satisfy 



the Euler relation 
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Zji=k ' “i 

Note that this is just Euler's formula when k = -1, F 

becomes f.(K). Using this formula we get 
^ .d-1 



0 and h^ (F) 



(*) = (-«’■ Z 

= E Ei=j 



where S indicates summation over all j-faces F. On the other hand, 
ing the same thing, but considering i-faces rather than j-faces we get 

,d-l 



(*) = £ I 
= £ > 

.,here S indicates suntaation over all i-faces . This last equality 

comes from answering the question "How many j-slmplices are determined by 
an i-face?" Since the reduction to simplicial polytopes assures us that 
each i-face is an i-simplex, it is clear that h^(F ) - 

these formulas are valid for each fixed j where -1 ^ j ^ d-2. Combining 



them we get the equations 

(**) Ai=:j ^ \j+iy i ^ J 

It would be nice here if these d equations in the d variables 
f ,f , would allow us to obtain algebraically as a function 

0 1 Q X 

of f . However they were obtained under the assumption that K was a 
simplicial polytope, and it turns out that only 2 ^ these equations 
are independent, ([z] denotes the greatest integer in a.) We state 
this result and the corresponding result about all d-polytopes in a more 



formal form. For each d-polytope K let f(K) denote the vector 
f(K) = 

6. The dimension of the affine hull of the set 

{f(K) 1 K is a d-polytope] is d-1. 

7. The dimension of the affine hull of the set 
[f(K) I K is a simplicial d-polytope] is [d/2]. 

Note that Euler's formula requires that the dimension of the flat 
in 6 be at most d-1, since each vector f(K) lies on the hyper plane 
H(u,a) c: where u = (+1,-1,+1>-1> • • • >(■!) ) € E and a - 1 - \ 1) 

Proofs of these results are omitted. 

Even though only about half of the equations (**) are independent, 
they will still be useful in the following form. It is possible to find 
independent equations giving each f^, i 5 [d/2] in terms of the 
f., i < [d/2]. For example in E^ these equations become 




in E^ the equations become 

f2 = 4f^ - lOf^ + 20 
fs = 5f^ - ISfQ + 30 

while in E the equations become 



fs = 3f2 - 5f^ + 5f(, 

S = *2 - 3^1 + 2fo- 

These equations were first found in 1905 by Max Dehn [1] for d - 4 and 5 
in a complicated way as the by-product of some other results. In 1927 
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d 

Sommer vi lie [l] worked out similar equations in E . These were ignored (even 
by Soramerville in his 1929 book L^j) the 1960 's. In 1962 Klee [3] 

independently found these equations and more besides. 

While the Dehn-Sommerville equations (**) will prove useful, they still 
do not allow us to find p,(v,d) for large values of d. What is now needed 
is to get some relation between f^ and f^ for all 0 ^ i < [d/2] so that 
the Dehn-Sommerville equations may be expressed in terms of f^ above. As a 
first (and apparently crude) estimate, note that each i-face has i+1 vertices 
and for two i-faces to be different they must differ in at least one vertex, 
so f ^ ^ for i gd. In a certain rather surprising sense these are 

also best possible when i < [d/2], namely, there exist certain polytopes, 
the cyclic polytopes among others, for which equality holds in these cases. 

We will use this fact, together with the Dehn-Sommerville equations to evaluate 
|j,(v,d) in some cases. But first we investigate the cyclic polytopes following 

Gale [ 1 , 2] . 

d 

Let t € E^ be a real parameter and define p(t) 6 E by 
p(t) = (t,t^,...,t‘^) . The set [p(t) | t is real} is called the momen t curv e 
in E^ If t <t <***<t are any d+1 or more distinct values of t, 
then the d-polytope C(v,d) = conv[p(t^ , . . .p(t^)} c: E is called a cyclic 
polytope . It may be shown that the combinatorial type of C(v,d) is not 

dependent upon the particular choices of the t^. 

8. Let C(v,d) be a cyclic polytope with any number v s d+1 of 

vertices. For any number n ^ [d/2], let = [p(tj^) , . . . ,p(t^)} be an 
arbitrary subset of n vertices of C(v,d) . Then determines an (n-1) -face 

of C(v,d). Therefore, f^(C(v,d)) = 0 ^ i < [d/2]. 

To show this we give the equation of a hyperplane H and note that each 
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oft ) e V lies on H, while all other vertices p(t) lie in one open half- 
space determined by H. Consider the polynomial 

rn 



o _ . 2n 



where t is the variable and each p. depends only on the constants 

which determine the points of V„. Note that are the only roots of 

the polynomial and the polynomial is non-negative. Thus if 

b = ^ “ '^0 " 

and equality holds if and only if t = t . for some i. Thus H(b,-Po) is 

the hyperplane, and the proof is complete. 

It is easy to show that the cyclic polytopes are simplicial polytopes. 

This is useful because it implies that the Dehn-Sommerville equations hold for 
cyclic polytopes. Furthermore, using just the properties of C(v,d), it is 
possible to calculate the number f. of i-dlmenslonal faces of C(v,d). 
in particular, the general formulas for the (d-1) -faces of cyclic polytopes are 

^ C") 

f,.l(C(v,d)) = if d=2n + l. 

in E* if we apply f^ S (’) = (2°) to the Dehn-Sommerville equations 



we get 

fj S ^Q(fg-3) 

and Theorem 8 shows that equality holds for the cyclic polytopes. This give= 
the complete solution to our problem if d = 4, and similar reasoning works 
when d = 5. To summarize: letting p^(v,d) equal the maximum possible 

number of k-faces in any d-polytope with v vertices, we have 
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9. \}.^(yA) = iv(v-3) 

li. 2(v,4) = v(v-3) 

lj, ^(v,4) = ^(v-1) 

Similarly, 

10. In E^, since = 2f^ - 6 £q + 12 and ^ ( 2 °) 

2 

equality holding for CCf^jS), it follows that ^^("^,5) v 7v + 1 . 

Analogous equations may be found for |J.|^(v,5), 1 = k ^ 3, Unfortunately, 

this method already breaks down if d = 6. We know that ^2 “ ( 3 ) 

fl - ( 2 ^) equality holds for cyclic polytopes) but these 

are not of use in the Dehn-Sommerville equation £5 = ^2 “ ^^1 

the f have alternating signs in this equation. For larger values of d 
i 

even the Dehn-Sommerville equations (**) get out of hand. It is possible 
to solve for f^_^^ in general in terms of the f^ for i < [d/2], and note 
that the coefficients of the f^ have alternating signs. But for values of 
k strictly between [d/2] and d-1, the coefficients of f^ in the repre 
sentation of f^^ are so complicated in general that no one has even proved 

that they have alternating signs. 

For this reason we now concentrate on finding iJi(v,d) - 
the larger values of d. The appropriate Dehn-Sommerville equation for this 
case is easier to express in the even and odd case separately; If d = 2n 



^2n-l 




n+i+1 




2n-2-i 

n-1 



) 



f. 

1 



and if d ss 2n+l 




Ci=-i 






we also kno« that ^ (i°i) If 0 S i < [d/2] in simplicial polytopes «ith 

equality holding for the cyclic polytopes. It was this latter result that 
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Klee generalized in 1962, thereby providing the major break-through that lets 
us determine M.(v,d) inmost cases. Klee's result is (see Klee [2]) 

11. For every simplicial polytope, is related to 

I 

the fornmla; (s+l)f^ S (£q - Futtherniore, this is independent of L 

the dimension of the polytope, and equality holds if and only if the polytope 
is (s+1) -neighborly (i.e., each (s+1) vertices determine an s-face) . 

This result is proved by the above process of counting things in two 
different ways, and it may he extended to the following relation between f^ 

and 

(s+1) I 

for 0 S j S s. In particular, equality holds in the above formulae for the 
cyclic polytopes when 0 S s < [d/2] (because cyclic polytopes ate [d/2]- 

neighborly) . 



As an example, we will illustrate how this result can be used to obtain an 
upper bound for f^ j^ for simplicial polytopes when d = 8. 



a) 


*7 = ^3 - - ^^0 


Dehn-Sommerville equation 


b) 




Theorem 11 for simplicial polytopes 


c) 


h ■ f^) 


a) and b) 


d) 


• (i - •^)('") * • “o 

if fp S 15 


f ^ and assuming that 

^i ■” \i+l/ 




(l - 1^^) a 0, i.e., f(, a 15. 


e) 


= f^(C(fo,8)) if fo a 15 


all inequalities are equalities in 






the case of cyclic polytopes . 



It is possible that the term (1 - 12/(f(,-3)) in d) might be negative, and 
thus make the estimate in the step to d) invalid. But this can only happen 
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1 




if <15. Hence the above proof is valid only if fQ S 15. Similar argu- 
ments work in the case of general d provided 

- 1 where d = 2n 

(n+1)^ - 2 where d = 2n+l 

For any particular value of d, these results can also be extended to get 

values of f for all intermediate values of k, provided, as in the above 

case, that f is sufficiently large compared with d. 

0 ^ 

The above arguments show that for all polytopes P in E with 

sufficiently many vertices, the maximum of is assumed when P is 

a cyclic polytope. The question now arises, when do the cyclic polytopes 
determine the maximum value of f^ (for fixed f^ and d) over all polytopes? 
The upper bound conjecture asserts that this is always the case. Stated in 
other terms, we know that f^(C(v,d)) S p,^(v,d); the upper bound conjecture 
asserts that equality always holds. The following theorem lists all the cases 
for which this conjecture has been established (by an elaboration of the above 

methods , or by other means) . 

12. The upper bound conjecture |J,^(v,d) = f^(C(v,d)) is true at 



least 


in 


the following cases: 






(1) 


for 


every k. 


1 ^ k ^ d-1, provided 


V 


is sufficiently 


(2) 


for 


every k , 


1 g k ^ d-1, provided 


V 


g d+3 


(3) 


for 


k = d-1 


provided 












V ^ - 2 


if 


d = 2n 








V ^ (n+1)^ - 3 


if 


d — 2n + 1 


(4) 


for 


k « [d/2] provided 












V 5 + 3n - 6) 


if 


d = 2n 








V g ^(n^ + 5n - 4) 


if 


d = 2n+l 






f ^ 
0 - 
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(5) for all (allowable) v and k provided d g 8 . 

The values of ii^(v,3) and p^(v,4) have been known since at least the 
first of this century. The cases for d = 5 and 6 were established in 1961 
The most important parts of Theorem 12 were established in separate papers by 
Klee [2] and Gale [3] in 1964. The upper bound conjecture itself is due to 
Motskin [1] who categorically stated in a 1957 research announcement that it 
was true. But since no detailed exposition has appeared in the intervening 
years it seems reasonable to refer to it as a conjecture. ^ 

/ 

/ 

/ 



Discussion . 

Johnson pointed out that the first argument of Theorem 4 is further com- 
plicated by the fact that there exist non-convex simply-connected polytopes 
in for which there is no simpllclal subdivision by diagonals . He also 

asked if Theorem 3 could be proved by use of Schlegel diagrams . (Def initxon: 

A Schlegel diagram of the d-polytope P is obtained by choosing a point x i P 
sufficiently near the interior of a facet F of P, and then projecting 
radially from k onto the facet F. This gives a (d-1) -dimensional representa- 
tion of the faces of P. all of which (except F Itself) appear in a cellular 

division of F. For example! 3 



2-dimensional Schlegel diagram of 
the cube in E is shown. (Note 
that it is what you see if you "peek 
in a face,") Griinbaum pointed out 
that the questionable assumption in the 
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early proofs of Euler *s formula (Theorem 3) is still far from obvious in general, 
even in this reduced 2-dimensional case. 

4 « 

Griinbaum also presented the following related problem posed by J, Steiner 
more than a century ago, which shows a difficulty in the idea of ’’slightly 



moving vertices.” Given a d-polytope P 
is combinatorially equivalent to P (i.< 
lattices) and which has all its vertices 
Bruckner stated the answer was obviously 
His false argument said: project P to 

be the convex hull of the projections of 
in that you may wish that four 
particular points on the sphere 
determine the triangles shown 
above, while in fact they 
determine the triangles shown 
below. This projection provides 
a homeomorphism but the question 
is whether the cells have affine 



can you find a d-polytope P’ which 
., S'(P) and Sf(P’) are isomorphic 

on a sphere? In his 1900 book, 

”yes” if the polytope was simpiicial. 
a containing sphere, and take P’ to 
the vertices. The main trouble occurs 




representation, and if this is made to be so, is the resulting figure still 
convex? In 1928 Steinitz [2] showed not only that Bruckner's proof is false, 
but even proved there are lots of polytopes (simpiicial ones among them) for 
which there does not exist any representation with vertices on a sphere, 
Steinitz 's proof is more readily explained in the dual formulation involving 
3-polytopes with an inscribed sphere. There is a wide class of 3-polytopes 
which do not have any realization with an inscribed sphere meeting each face. 



Start with any poly tope which has more vertices than faces, for example, the 
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cube. Slice off each vertex to form v new faces. If only a little is sliced 
off then these new faces will be pairwise disjoint. Assume such a polytope P 



has an inscribed sphere. Each edge 
subtends the same angle from the 
two points of tangency in the 
adjoining faces. But since there 
are more "newer" faces than "old" 
the sum of the subtended angles in 
the new faces must be more than the 
implies that two "new" faces must have 




le sum for the old faces. But this 
an edge in common, a contradiction. The 



dual of any such example becomes an example where the vertices cannot be in- 



scribed in a sphere. 

A different approach establishes the existence of a family of simple 
3-polytopes which have no combinatorially equivalent representative inscribed 

3 

in a sphere. To explain this approach, consider K, a cube in E with 
one vertex sliced off. If we "peek in" the triangular face, the resulting 
Schlegel diagram for this polytope is clearly seen to be 




H 
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Assume that this sliced cube had each vertex on a sphere S. Also assume that 
the projection used to obtain the figure above was of the following special sort 
Take a point z on S above the triangle, and take the usual stereographic 




projection onto the opposite supporting plane P. This stereographic projec- 
tion sends circles on S (not containing z) onto circles in P. The quadri- 
lateral ADGF c: P is thus inscribed in a circle (the image of the circle which 
is the intersection of S with the corresponding facet of P) . Similarly the 
quadrilaterals EDGE and CEGF are inscribed in circles, as are the pentagons 
ADBIJ, BECHI, AFCHJ. Using the fact that opposite angles of any quadrilateral 
inscribed in a circle add up to 180°, it follows that D, E, F lie on the 
sides of the triangle ABC. Thus, for example. A, D, B are collinear, a 
contradiction to their being on a (proper) circle. This proves that no poly- 
tope with the combinatorial type of K can have its vertices on a sphere. 

Yale asked what the motivation was for using the word "cyclic" in cyclic 
polytopes. The moment curve is not the only curve which can be used to generate 
neighborly polytopes. Another curve with this property and described in terms 



2n 

of the real parameter t is the set of all points in E of the form 
(sin t, cos t, sin 2t , cos 2t , . . . , sin nt, cos nt) . This curve was used sixty 
years ago by Caratheodory in his studies of analytic functions. Notice that it 
is a bounded, periodic curve, i.e., it "cycles" in E^*^. Caratheodory* s work 
with cyclic polytopes was forgotten until the 1950* s, when Gale [1, 2] first 
independently described neighborly polytopes in a complicated way, rediscovered 



Caratheodory *s work, and then went beyond Caratheodory to get simple proofs to 
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further results on cyclic and neighborly polytopes. The names are due to Gale 
Coxeter suggested that the mathematical community should read as 

"a choose b" rather than "a over b." 

Theorem 11 of Klee is valid for all simplicial complexes. It is still an 
open problem as to whether this can be extended to non-simplicial polytopes. 
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11, Related results and problems . 



In the last section we worked directly toward an answer to the problem 

•'What is ij,^ j^(v,d)?*^ In answering that question we presented many results 

which lead to interesting generalizations or problems on their own. Several 

of these are discussed in the present section. 

We first consider various generalizations of Euler's formula. There are 

many similar equations associated with other characteristics of polytopes. 

The first example we shall give is a generalization of a fact known to Euclid; 

2 

the sum of the interior angles of an n-gon in E is (n-2)iT, or (n-2)/2 
full angles. Let P be a d-polytope, F any k-face of P, 0 ^ k ^ d, 
and denote by c(F,P) the convex cone spanned by P from the vertex at the 
centroid of F, Let cp(F) be the fraction of E^ taken up by the cone 
c(F,P), (Intersect c(F,P) with the unit sphere whose center is at the 
vertex of c(F,P) to make this definition precise,) In the planar case 
where P is a convex polygon, 



i l if F = P 

^ if F is an edge 

(the interior angle at F)/2rr if F is a vertex, 

t tl 

For each k = 0,l,,,,,d, define the k — angle sum by 

Q'^(P) = S cp (F) where the sum is taken over all k-faces F of P, 

Thus the result known to Euclid could be rephrased; for all 2-poly topes P, 
cVq(P) = (fQ"2)/2 = (f^-2)/2 s= Q'j^(P) - 1, This result has the following 
generalization , 

1, For every d-polytope P, 

The special case d = 3 was discovered by Gram in 1874, It is well- 

3 

known that in E , if all faces of P are of the same type, then they must 
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be either triangles or quadrilaterals or pentagons. No generalization of this 
fact for higher dimensions was known until recently when Perles and Shephard 
[1] obtained results of this type using these angle sums of^(P) . Even if we 
know that all facets of a 4-polytope are 3-polytopes of a particular combina- 
torial type, there is no bound on the number of vertices of these 3-polytopes 
in different examples. Perles and Shephard showed the possibility of using 
certain classes of polytopes as facets and the impossibility of so using 
certain other classes. 

It is interesting to note that the equation of Theorem 1 is unique in 
the following sense. 

2. If (-l)^p.Qf.(P) = ^.(-1)^^“^ holds for all d-polytopes P, 

then = ^ ^d* 

We now turn to an example of a Euler- type equation involving d-polytopes 
which is a vector equation rather than a scalar equation. For each vertex 
X 6 P, consider the cone generated by the outward normals to all hyperplanes 
which support P at x. As in the last example, consider the content \j/(x) 
of this exterior cone. For obvious geometrical reasons \Kx) is called the 
exterior angle of P at x. We form the vector sum of these vertex points 
using the content of the cones as weights, and define the resulting vector 
S(P) to be the Steiner point of the polytope P. That is, 

S(P) » 2 tCx) X (summed over all 0-faces x) , 

The most useful properties of S(P) are given by 

3, (a) S(P) depends upon neither the dimension of the space which 

contains P nor the location of the origin in that space, (b) For all real 

and all polytopes P,Q, 

S (aP + pQ) = o?S (P) + pS (Q) 
wh6iT6 addition on tho loft is voctoir addxtxon^ 
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Now each i-face 0 S i g d-1, is also itself a polytope and therefore 
admits its own Steiner point S(F). For each i, 0 ^ i ^ d-1, let us define 
a vector by 

|jb^(P) = Z S(F) (summed over all i-faces F of P). 

Note that [jb. (P) definitely depends upon the location of the origin and in 
general will not be in the polyhedron P. Yet taken together they satisfy 
the Euler-type vector equation 

4, (-l)V. = (1 + (-l)^‘bs(E). 

1=0 

It has recently been shown by Sallee [l] that the Steiner points also have a 
nice valuation property, namely, when P^^ are poly topes and P^ U P 2 is 

convex S(P^) + S(p 2 ) = ^ 2 ^ = S(P^ U P 2 ) + ^ ^ 2 ^ * 

The points S(P) have been called Steiner points because in 1840 
J, Steiner proposed studying the center of gravity of a mass distribution on 
a curve in where the mass at a point on the curve is proportional to 

the curvature at the point. The Steiner point as defined here for a polytope 
is the discrete analog of the center of gravity of such a mass distribution. 

It might be noted that if the support function of a convex set is 
developed in a Fourier series, then a translation of the origin changes only 
the first-order coefficients of the Fourier series. If the origin is trans- 
lated to the Steiner point then these first order coefficients will be zero. 
Similarly in higher dimensions, if the support function is developed in 
spherical harmonics of the appropriate dimension, the origin is at the 
Steiner point if and only if the first order terms are zero. Since the 
supporting functions add when convex sets are added, it is thus reasonable 

to expect that S(P *f Q) = S(P) -f- S(Q) . 

* * * * 
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We next turn to a generalization of Klee's inequality (Theorem 10,11) 
which is valid for all simplicial complexes rather than just those simplicial 
complexes realized by simplicial polytopes. If a given simplicial complex 
has f k-faces, it is natural to ask what is the best possible lower 

iC 

[respectively, upper] bound on the number f^ of i-faces of the complex, 

when i ^ k [respectively, i 5 k] . This problem has been solved by J. B. 

Kruskal [l] using a rather complicated proof. We will illustrate Kruskal's 

result by an example. Suppose it is known that in a given simplicial complex 

f, =30, We write f, in a certain canonical form: 

4 4 



*4 = (s) + (4) + (3) + it) (l)- 

In general the canonical form of f^^ starts with + 



• • • • 



We 



choose the constants a,b,c,... as follows. Choose a as large as possible 
so that f^ ^ Having chosen a (a = 7 in this example) choose b as 

large as possible so that f^ § this process until 

equality is reached at some stage. Thus f^ = 30 = + Q) is a 

canonical representation. Then for i, the best possible bounds on f^ are 



= (i+i) + (D + (i-i) 



given 

as long as the representation of f^. (Terms like zero terms, 

j S 1.) Thus in our example. 



where this representation continues 
n 

n+j- 



■ (I) - © - © ■ « 

and 

«e ■ Q - © * © ■ © ■ 

It must be emphasized that these are the best possible bounds for the class of 
all simplicial complexes. Even though they are bounds for the f^ in terms 
of f, for poly topes, they are generally not the best bounds. Nevertheless, 

tC 

Kruskal's result has proved its usefulness as a tool in the theory of polytopes 
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by helping to establish several cases of the upper bound conjecture. In 
particular it allows us to establish the upper bound conjecture in the case 
d =: 7, and the case d = 8 follows from the case d = 7 by a rather simple 

argument . 

It would be interesting to know how Kruskal’s results might be extended 



when extra geometric conditions are added. We know that we can embed 
topologically* or even rectilinearly,. each simplicial k-complex in E 



But suppose that a simplicial k-*complex can be embedded in 



adds restrictions which conceivably can be used to obtain 

the f . . 

1 



j.k+1 ^ g 

better bounds on 



* * * * 

A problem similar to the one considered in Section 10 is, Given a 
d-polytope with v vertices, how small may f^ be? The answer is easy 
if d = 3, and is known for most cases if k = d-1, but almost nothing 
is known for intermediate values of k. We first show that as much is 
known about the lower bounds on 

f 

^d" 

as the upper bounds . For a fixed dimen- 
sion d, plot £q > d against f^^^, 
placing a dot on the graph whenever 
there is a d-polytope having this 
number of vertices and facets. It is 
clear from the duality which takes 

f into f j , that this graph must be 
0 d-1 

symmetric about the diagonal. Thus the lower 
from knowing the upper bounds on This 

f In the dual setting we are then asking 



^ « .1 . ■ a— f Q 

bounds on f^^^^ can be found 
technique does not work for 
for upper bounds on 
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terms of the number of edges. Thus these intermediate cases for k < d-1 

do not reduce to the results of Section 10. 

For the upper bounds on f^ ^ we saw that it was sufficient to look at 

only simplicial polytopes , It would be nice to get a similarly restricted 

class of polytopes in which the lower bounds are always assumed. Unfortunately 

the simplicial polytopes do not have this property. In fact no reasonable 

conjecture has even been made as to a class of d-polytopes which minimize 

f (P), Nevertheless, it is still interesting to restrict our attention to the 
k 

simplicial polytopes and ask for the lower bounds on fj^ there. A certain 
adding process which builds up simplicial poly topes in E has lead to 
specific conjectures in as to what the lower bound for f^_^ should be 

for simplicial polytopes. In particular, suppose that a particular simplicial 
polytope in E^ with a given number of vertices minimizes If we add 

an extra vertex we must add at least d edges, and similarly at least (d-1) 
facets. This process has lead to the lower bound conjecture ; If Vj^(v,d) 
is the best lower bound on fj^ for all simplicial d-polytopes with v 
vertices, then 

= (O’' ■ (Oiy 

Vd j^(v,d) = (d-l)v - (d+1) (d-2) , 

This is easily checked in E , but in E it is not at all clear that the 
process of adding an extra vertex gives analogous results. The lower bound 
conjecture has, in fact, only been proved for those cases where an actual 
enumeration was made for all polytopes with this number of vertices. Perles 
has a few days ago reported that the lower bound conjecture is true whenever 
there are at most d+9 vertices, thus in particular when Vq ^ 13. The 
methods used do not seem to allow for significant improvement, 

* * * * 



o 
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One immediate application of results of the type of Section 10 is to the 
coloring problems. The famous 4-color problem has been generalized in 
different ways to higher dimensions, and also to surfaces other than the 
sphere in E^. In the higher dimensions most formulations of the problem do 
not admit upper bounds in the sense that no finite number of colors is 
sufficient to color all examples. The coloring problem generally turns out 
to be solvable on surfaces in E^ other than the sphere. A minor reduction 
of the 4-color problem is easily obtained, namely it is equivalent to the 
coloring of the 2-faces of all simple 3-polytopes (i^e., polytopes with each 
vertex trivalent) . 

The duals of the cyclic polytopes in E^ show that there are 4-polytopes 
with as many 3-faces as desired, each two having a common 2-face. The Schlegel 
diagram in E of each such 4-polytope gives an example of as many 3- 

O 

poly topes as desired in E , properly meeting at their boundaries, and each 
two of them will have a 2-face in common. This was independently discovered 
by Eggleston, Besicovitch, and Rado about fifteen years ago. 

There is, however, at least one meaningful variant of the 4-color problem 
for d-poly topes which has a non- trivial solution, namely; 

5. The 2-faces of any simple d-polytopes (d U 4) may be colored 
using at most 6d - 12 colors. (The coloring is supposed to assign different 
colors to pairs of 2-faces which have a common edge.) 

An open problem in this area that appears to be very difficult is the 
following. It is well-known that maps in 2-manifolds in E^ may need a large 
number of colors if the genus is sufficiently large. If we consider manifolds 
that are not just topological, but also locally-affine in the sense that each 
“country" is a planar convex polygon, then there might be an upper limit on 
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the number of colors necessary, independent of the genus of the manifold. A 
very shakey guess would be that this universal upper limit on the number of 

necessary colors is 6. 



Discussion . 

Since many of the results of the last two sections concern cell complexes , 
there was a discussion on how this is related to algebraic topology. Grunbaum 
pointed out that usually a topologist is not interested in any particum cell 
decomposition, but rather quickly passes to subdivisions and uses them as a tool. 
On the other hand a graph theorist will tend to consider one graph or complex 
and analyze its particular structure. Perhaps something could be done between 
these two positions, but many such problems are quite hard. For example, no 
reasonable conditions on a 2-complex are known v*ich guarantee that it can be 
embedded in a space of dimension less than 5 . Each 2-complex may be embedded 
in E^ , and if it is also a manifold then it may be embedded in E . But it 
would be nice to know, for example, which 2-complexes may be embedded in E^ 
both topologically and piece-wise linearly. Note that even though a refine- 
n.ent of a 2-complex may be geometrically as well as topologically embedded in 
E^ this does not imply that the original 2-complex is so embedable. This is 
in contrast to the one-dimensional case, where every 1-complex may be embedded 
in E^, and if it goes into ^ as well, then it may be embedded geometri- 

caily, that is with straight line segments. 

AS an example, consider a 2-c<xaplex which is the union of 9 hexagons as 
shown below. This manifold is topologically a torus, with the similarly labeled 
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vertices identified in the obvious way. It is easy to show that if this can 

be topologically and geometrically embedded (with "flat" hexagons) in any 

3 

higher dimensional space, then it can be embedded in E . Assuming it 



3 

is embedded in E , let us count the 
sum of the interior face angles of the 
hexagons in two different ways , The 
sum of the face angles in each hexagon 

X 

is 4tt. Thus the sum of all interior 
face angles is (4tt)9 = 36tt. On the 
other hand, 6 • (jf’ of hexagons) = 

3(# of vertices) , by counting the 
number of edges in two different ways . 
Also, the sum of the interior face 
angles around a vertex is at most 
2 tt and equals 2ir only if the 
3 hexagons at that vertex lie in a 
plane. Combining these results, we 
can conclude that all the hexagons 
must be in one plane, a contradiction. 
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12, Polytopes in , 

Much more is known about polytopes in than about higher dimensional 

polytopes, in part since it is actually possible to construct models of them. 

It is much easier to guess what the solutions of a problem should be when there 
is an actual model to look at. Further, 3-polytopes actually do have a simpler 
structure than 4-polytopes, just as 1-manifolds are simpler than higher dimen- 

sional manifolds. 

Part of our familarity with 3-polytopes is due to the possibility of 
representing them in the plane. The results and methods of graph theory may 
be applied to the 1-skeleton (set of vertices and edges) of the Schlegel dia- 
gram of a 3-polytope. It is clearly seen that the Schlegel diagram of a 3- 
polytope is planar. The converse question was open for a long time, namely; 
which configurations in E^ of convex polygons meeting properly on their edges 

will be Schlegel diagrams of 3-poly- 
topes? We want each such configura- 
tion to have at least four vertices, 
and each vertex must be at least 3- 
valent. Further properties are 
necessary, since the graph in this 
figure can obviously not represent all 
but one of the faces of a 3-polytope. 
In this section a graph means a finite set of nodes (vertices) and a set 
of edges which connect certain pairs of these nodes . A graph is n-connyt ed 
if for each pair x,y of vertices there are at least n disjoint paths along 
the edges which connect x to y. (Two such paths are disjoint if they have 
only X and y in common.) Equivalently, a graph with at least n + 1 
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nodes is n-connected if any n~l or fewer nodes and their adjacent edges may 
be removed, and the graph remains connected. For example, the graph with solid 
lines is not 3-connected because the removal of x and y and their adjacent 

edges leaves u and v in separate 
components o Equivalently, each path 
from u to V goes through either 
X or y, and thus there are not 
three distinct such paths. However, 
if the dotted line is added as an edge, 
then the graph becomes 3~connected , 

We say that a graph is 3-realizable 
if it is isomorphic to the graph of 
some 3-polytope. 

Another well-known property of the graph of every 3-polytope is that each 
such graph is 3-connected. About fifty years ago E. Steinitz gave several 
proofs of the converse statement which is one of the most important and deepest 
results concerning 3-polytopes, Unfortunately his work was forgotten until 
comparatively recently. 

1. (E. Steinitz [l]) . Every planar, 3-connected graph is 3-realizable. 

An immediate corollary is 

2, Each planar 3-connected graph may be "stretched" into a shape where 
all its edges are straight and all bounded regions in the plane which it 
determines are convex polygons . (We interpret our definition of graph to 
exclude two distinct edges connecting a pair of vertices, and the ends of each 
edge must be distinct vertices.) 
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The proof of Theorem 1 is very difficult. We x^ill just present its main 
ideas and point out the difficulties. 

For a graph to be 3-connected, it clearly must have at least six edges, 
and it will have exactly six edges only when it is the complete graph on four 
nodes. Since we observe that this graph is 3-realizable (by the tetrahedron), 
we have started our induction proof on the number of edges of the graph. Hence, 
assume that we are given a graph with at least seven edges. To sketch how the 
proof will go, we first establish that the graph has 3-valent faces or vertices. 
Secondly, we will list two ways to transform a graph G into a graph G' such 
that if G' is 3-realizable, then a 3-poly tope P may be constructed which 
realizes the graph G. These transformations will be called elementary trans- 
formations. Thirdly, we will show that if a 3-connected graph G has a 3- 
valent vertex on a triangular face, then there exists an elementary transfor- 
mation of G to a 3-connected graph G' with fewer edges than G. The 
induction hypothesis implies G* is 3-realizable and hence G is 3-realizable. 
Fourthly, if G does not have a 3-valent vertex on a triangular face, then a 
finite sequence of elementary transformations of G yields such a graph, in 
which case the above argument applies, and the proof is complete. 

(1) Let V, e, p denote respectively the number of vertices, edges and 
facets of a 3-polytope P. Let v^, p^^ denote the number of vertices or 
facets, respectively with exactly k incident edges. Thus v = v^ and 

p = S, « P. j so Euler's formula may be given the form 

It is also clear that 

SkS3 = \S3 
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or 




Combining these two equations we get 



V 3 + ?3 = 8 + (k-4) (Vj^ + p^) S 8 , 



Thus we conclude that each 3-polytope has at least eight 3-valent elements 
(i.e., 3-valent vertices or triangular faces). Since this argument applies to 
graphs on the 2 -sphere as well, it also applies to planar 3 -connocted graphs, 
(2) If a 3-connected graph G has a triangular face, as shown on the 
left below, then we define an elementary transformation T] on G to obtain 
a graph G* = TIG by removing the three edges of the triangular face and adding 
one new vertex which is then connected to the three vertices which formed the 
triangular face; if a vertex of G* is 2-valent, we replace it and the two 
edges incident to it by a single edge, A typical 7 ]- transformation is shown 
below. 



The graph G' is easily seen to be 3-connected, If G' is 3-realizable by a 
3 -polytope P, it is clear that the "new" vertex may be "sliced off" by a 
plane through three points corresponding to the three old vertices, thus obtain- 
ing a realization of G, Note that if each vertex of the triangular face is at 
least 4 -valent, as shown above, then the number of edges remains the same. 
However, if one or more of the vertices is 3-valent, as in the example shown 
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below, then the number of edges may be reduced by at least one. 




We also define a second type of elementary transformation. A trivalent node 
and the three edges incident to it are removed from G, and the nodes con- 
nected to it are pairwise connected by "new" edges (unless some of them are 
already edges in G, in which case there is no need for the "new" edge). 
Examples of such a transformation OJ to get a new graph G* are shown below. 



r. 




If P is a 3-polytope which realises G' f'-.n f faces Pj^P2^3 are "extended" 
to get a realization of G. As the second example shoao, a new face might also 
need to be formed. It can be shown that it is always possible to obtain a 





realization of G from any realization of G'. In each case, an elementary 
transformation does not increase the number of edges. 

(3) If a triangular face has at least one 3-valent vertex, then the 
transformation T] reduces the number of edges. In this case the induction 
completes the proof. 

(4) The proof of the fact that some finite sequence of elementary trans- 
formations gets us to case (3) is technical and is omitted. It uses special 
properties of 4-valent, planar graphs. This completes our sketch of the proof. 

Note that Theorem 1 characterizes which graphs are isomorphic to the graphs 
of 3-polytopes. It is also possible to characterize in graph theoretic terms 
graphs which are isomorphic to graphs of centrally-symmetric 3-poly topes, or to 
graphs of 3-polytopes with a plane of symmetry. We give one other corollary. 

3. Every 3-polytope may be obtained from a tetrahedron by a finite 
sequence of applications of two particular types of operations. (Each operation 
admits several different cases , as we saw abovu J 

The next natural question concerns generalizations of Steinitz*s theorem 
to higher dimensions. There are not even any conjectures here about what the 
conditions in such a theorem should be. For example, suppose a 3-polytope is 
partitioned into the union of disjoint 3 -dimensional convex poly topes which 
meet properly along their boundaries, and which have appropriate connectedness 
properties. It is reasonable to ask, does there exist a 4-polytope for which 
this is its 3-dimensional Schlegel diagram? Such configurations must have 
certain obvious properties, but there exist examples which satisfy all such 
known properties, but which are not Schlegel diagrams of any 4-polytope. The 
smallest simplicial example of this consists of just eight vertices; i.e., a 
tetrahedron with only four additional interior vertices which define a partition 
into 19 other simplices. 
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The 4-color problem concerns the coloring of connected planar regions. If 
the graph associated with the regions is 3-connected, then Steinitz's theorem 
implies that it is the graph of a 3-polytope, and coloring the planar regions 
would be equivalent to coloring the facets of the polytope, (A coloring assumes 
that regions or faces with a common edge will have different colors, and asks 
for the smallest number of necessary colors,) If the planar graph is not 
3-connected, then easy arguments show that there are two subgraphs whose color- 
ing would imply the coloring of the whole graph. For example, to color the 
graph at the left-, it would suffice to color the two graphs at the right. 






The corollary below therefore follows from Steinitz*s theorem, although other 
easier proofs exist, 

4, The 4-coloring problem in the plane is equivalent to the co: responding 
4-color problem for the faces of 3-polytopes, 

* * * * 

For about 100 years, people have been trying to count the number of 
different combinatorial types of 3-polytopes with v vertices or p faces. 

The problem is hard, and most early papers were attempts to draw pictures, 
count them, and hope that none were missed. The problem has been significantly 
advanced recently, and it seems reasonable to believe that within a few years it 

3 

will be solved, at least for the simple polytopes, (P C E is simple if each 
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vertex is 3-valent.) 

We now consider a simpler problem which was solved by Eberhard [l] in 
1891. As above, let Pj^(P) number of k-gons in a 3-polytope P, and 

let p be the total number of facets (2-faces) , We say that a sequence 
(p^jP^,...) of non-negative integers is 3-realizable provided there is a 
simple 3-polytope P such that pj^ = ^k^^^ * simple 3-polytopes 

V *= V3, we have 

3v = 2e = kp^. 

Together with Euler *s formula v + p = e + 2 and 



p = Pk 

we get the following theorem, 

5, A necessary condition for a sequence (p^*..,) to be 3-realizable is 

(*) 3p3 + 2p^ + P3 = 12 + (k-6)p^. 

It is interesting that (*) implies that there must be at least four 

faces with fivo or fewer edges, but it says nothing at all about the number 

of faces with six edges. Examples show that two sequences may differ only in 

Pg and yet one is 3-realizable while the other is not. This leads to the 

« 

following question. Given a sequence (P3>P^»P2 »Py » • • •) which satisfies (*) , 
does there exist a value of p^ which makes (P3»p^»P3 >P^»Py » . . .) 3-realizable? 
Eberhard *s theorem says ”yes," 

6. If (P2>P^>P5»P7 > • • •) satisfies (*) , then there is some p^ which 
makes the sequence 3-realizable, 

The proof of Eberhard was quite hard, but by using Steinitz's theorem the 
proof can be simplified. Rather than prove this theorem we will prove another 
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related theorem which is easier, and yet which shows the main idea of the proof. 
If we work with 4-valent polytopes rather than simple polytopes we must use 
4v = 2e rather than 3v = 2e in the above equations. Similar reasoning then 
yields 

7, If each vertex of P is 4-valefot, then 

(**) P 3 = ® 0s-4)Pj^. 

In an analogous way, this leads us to 

8, (Griinbaum) If (p 3 ,P 3 sPg, . . .) satisfies (**) , then there is some 
p^ and some 4-valent polytope P such that P has p^ = Pj^ (P) for all k. 

The proof is accomplished by actually constructing a 3-connected, planar 
graph, with each vertex of valence 4, and with exactly p^^ faces with k 
edges. Then Steinitz*s theorem supnlies the desired 3-polytope P, Thus we 
need only construct the graph. Each k-gon that we need, k ^ 5, is formed 
in the following way from the inside of a square. 



6-gon 

It is clear that each such square produces one k-gon, (k-4) triangles, 
and a number of quadrilaterals , Thus if we use these squares as building 
blocks of our graph, we will preserve the ratios of p^ and p^*s in (**) • 
These building blocks are placed together as shown on the left below. They 
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are used as the "diagonal" of a large basic square, and corresponding edge 
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Basic square 




points of the building blocks are connected to the edge of the basic square 
as shown* vJe now have a graph which is 4-valent at each vertex except on the 
boundary of the basic square, and which contains exactly pj^ k-gonal regions. 
To take care of the edge vertices, we close up the basic square with circular 
arcs, as shown on the right above. This makes each vertex 4-valent, adds lots 
of quadrilaterals and exactly eight triangular regions as shown. Thus the 
formula (**) is satisfied by the p^^’s of this graph and the proof of the 
theorem is complete. 

A similar type of construction is applicable to the proof of Eberhard’s 
theorem; the details are much more involved, mainly because of the presence of 
three kinds of "small faces," No result analogous to Eberhard's theorem is 
known concerning 5-valent polytopes; the equation corresponding to (^) or 
(**) does not have any p^^ missing. 










It would be interesting to know what values of can appear in 

Eberhard*s theorem, and how small the value of p^ may be. It was earlier 
conjectured that perhaps p^ ^ n where p^ is the last non-zero member of 
the given sequence. A recent result of Barnette has disproved this conjecture. 
It has not disproved a conjecture that there is a constant c such that 
Pg ^ c Pjj* Examples show that c must be bigger than one in this con- 

jecture; probably the best constant is c = 3, The 1966 result of Barnette 
referred to is: 

11. If P is a simple polytope, then 

2?3 + 2p^ + 2p^ + 2pg + p^ ^ 8 + (k-8)Pj^. 

In particular, this equation may be used to obtain a lower bound on the 
size of p, , There is no similar result known for 4-valent polytopes. 



Discussion . 

A discussion of the proof of Steinitz*s theorem brought out the following 
facts: 

(a) We need to apply the elementary transformation T] to a triangular 
face and not a face with more sides. If T]* were the transformation shown 

m 

below, for example, and if P were a 3-polytope realizing the graph G* 
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(shown in part on the right) , then there is no way to assure that the new 
vertex may be "sliced off" with a plane through the four old vertices, since 
they may fail to be coplanar . 

(b) In the fourth part, of the proof of Steinitz's theorem, we cannot take 
a particular trivalent vertex and triangular face and guarantee that they can 
be brought together by a finite sequence of elementary transformations . We 
only know that such a sequence suffices to bring some pair of 3-valent elements 
together. The description of this process does, however, give an algorithm by 
which this can actually be accomplished in a finite number of steps; it is not 
just an existence statement. 

(c) Corollary 2 of Steinitz's theorem may be proved directly with less 
work than Steinitz's theorem, but it is not a simple fact, and false proofs 



have been published for it even recently. Direct proofs use only the graph 
theoretic properties; it does not help to keep track of which bounded regions 
are convex as the induction proceeds . Yale pointed^ out that this was remark- 
able in itself, as it supplies the first known admission by Griinbaum that con- 
vexity was not a useful property. 

In the discussion it was pointed out that bounds on p^ in Eberhard's 
theorem are known for certain special cases. For example, if p^ == then 



p must be zero or some even number greater than or equal to 4. If p» — 6, 
r6 

then p must be zero or at least 2. Some similar conditions imply that p- 
ma y only assume odd values. It was also remarked that Eberhard was blind 



since his youth. 



13 « Polytopes with rational vertices « 

We now turn to a topic in integer programming which extends some of the 
ideas we discussed after Section 10. I first heard of the following problem 
from Klee. For any d-polytope P, does there exist a polytope P of the 
same combinatorial type which has all of its vertices at rational points in 
E^? Equivalently, we could ask, could all the coordinates of each vertex be 
Integers? Until a few years ago there was nothing in the literature which 
answers this. The answer to the question is clearly "yes" in E , Is "yes" 
but not obviously so in E^, is unknown in dimensions 4 through 7, and is "no" 
in E^. If we start with a rational simplex (rational here means that the 
vertices have rational coordinates) in E^ and continue to transform it by the 
elementary transformations of Steinitz's theorem as in Corollary 12.3 above, 
then it is no loss of generality to assume that each transformed polytope is 
again rational. Thus the problem in E^ can be solved using the proof of 
Steinitz's theorem. In E^ it is always possible to get a rational simplex, 
or a rational realization of any simplicial polytope, but in general "moving 
a vertex a little" might change the combinatorial type of P even if 

p(P,P') g e in the Hausdorff metric. 

Perles (see Grunbaum [1], Section 5.5) has given an example of a (convex) 
8-polytope with twelve vertices which has no rational realization. This is 
the best example possible In E® In the sense that If P c E*^ has at most 
d+3 vertices, then It has a rational realization. The Irrationalities of 
Penes' example are not too bad In the sense that we could realize all coordi- 
nates of Its vertices In the field which Is the simple extension of the 
ratlonals by /I. More generally. It Is possible to obtain realizations of 

g 

all d-polytopes using the field of real algebraic numbers. Even In E , 
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though, it is not known whether every polytope may be realized using a field 
which is an extension of the rationale by a finite number of irrationals. 

Our next topic will show that this type of occurrence is not too surpris- 
ing. We will consider arrangements of lines in the projective plane (or 
arrangements of hyperplanes in projective d-space) . By an arrangement we mean 
a set of lines ia the projective plane which do not form a pencil at one point. 
We are interested in questions like, what are the regions into which the plane 
is divided, how many vertices are there, etc. We can ask when any two arrange- 
ments are combine tor ially equivalent in the sense that the equivalence pre- 
serves the number of lines and vertices and their incidences. In particular 
we can ask if every arrangement has an equivalent rational arrangement; i.e., 
an arrangement where the coordinates of all vertices are rational. 

In answer to the last question, the arrangement of ten lines and eleven 

vertices as shown cannot have a 

rational realization. It can be 

shown that any arrangement combinator- 

ially equivalent to this example is 

already projectively equivalent to it. 

Hence the cross-ratio (A,B;C,D) is 

irrational in each realization. The example is actually larger than necessary. 

The subarrangement of nine points and 

nine lines as shown has the same 

property, and is best possible in the 

2 

sense that any arrangement in E 
with at most eight points or eight 
lines has a rational realization. 
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Perles' example in E®, referred to above, makes use of this last 
example. His argument uses a particular transformation which maps a d-dimen 
sional arrangement with v vertices into (v-d-1) -dimensional space. We 
have used the word "arrangement" here rather than "configuration," since a 
configuration is usually considered to have a constant number of points on 
each line. 



Discussion . 

It was pointed out during a discussion that the two examples in E 
which do not have rational realizations, can in fact be realized in the 
2-dimensional vector space over the field R(/S) (i.e., the simple extension 

of the rationals hy /s ) , On the other hand, they cannot be realized if the 

field is R(/2) . 

'Hhe figure with ten lines and eleven vertices is "self- conjugate" in the 
sense that any combinatorially equivalent arrangement is either essentially 
the same or is one where the five vertices on the points exchange places with 

the five vertices on the inner pentagon. 
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